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Preface 


Norway is a country rich on natural resources. Wind, rain and snow provide us with 
a huge resource for clean energy production, while oil and gas have contributed 
significantly, since the early 1970s, to the country's economic wealth. Nowadays 
the income from oil and gas exploitation is invested in the world's financial markets 
to ensure the welfare of future generations. With the rising global concerns about 
climate, using renewable resources for power generation has become more and 
more important. Bad management of these resources will be a waste that is a 
negligence to avoid given the right tools. 

This formed the background and motivation for the research group Stochastics 
for Environmental and Financial Economics (SEFE) at the Centre of Advanced 
Studies (CAS) in Oslo, Norway. During the academic year 2014—2015, SEFE 
hosted a number of distinguished professors from universities in Belgium, France, 
Germany, Italy, Spain, UK and Norway. The scientific purpose of the SEFE centre 
was to focus on the analysis and management of risk in the environmental and 
financial economics. New mathematical models for describing the uncertain 
dynamics in time and space of weather factors like wind and temperature were 
studied, along with sophisticated theories for risk management in energy, com- 
modity and more conventional financial markets. 

In September 2014 the research group organized a major international confer- 
ence on the topics of SEFE, with more than 60 participants and a programme 
running over five days. The present volume reflects some of the scientific devel- 
opments achieved by CAS fellows and invited speakers at this conference. All the 
14 chapters are stand-alone, peer-reviewed research papers. The volume is divided 
into two parts; the first part consists of papers devoted to fundamental aspects of 
stochastic analysis, whereas in the second part the focus is on particular applications 
to environmental and financial economics. 
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We thank CAS for its generous support and hospitality during the academic year 
we organized our SEFE research group. We enjoyed the excellent infrastructure 
CAS offered for doing research. 


Oslo, Norway Fred Espen Benth 
June 2015 Giulia Di Nunno 
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Part I 
Foundations 


Some Recent Developments in Ambit 
Stochastics 


Ole E. Barndorff-Nielsen, Emil Hedevang, Jürgen Schmiegel 
and Benedykt Szozda 


Abstract Some of the recent developments in the rapidly expanding field of Ambit 
Stochastics are here reviewed. After a brief recall of the framework of Ambit Sto- 
chastics, two topics are considered: (i) Methods of modelling and inference for 
volatility/intermittency processes and fields; (ii) Universal laws in turbulence and 
finance in relation to temporal processes. This review complements two other recent 
expositions. 


Keywords Ambit stochastics * Stochastic volatility/intermittency * Universality - 
Finance * Turbulence - Extended subordination * Metatimes * Time-change 


1 Introduction 


Ambit Stochastics is a general framework for the modelling and study of dynamic 
processes in space-time. The present paper outlines some of the recent developments 
in the area, with particular reference to finance and the statistical theory of turbu- 
lence. Two recent papers [8, 36] provide surveys that focus on other sides of Ambit 
Stochastics. 
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A key characteristic of the Ambit Stochastics framework, which distinguishes this 
from other approaches, is that beyond the most basic kind of random input it also 
specifically incorporates additional, often drastically changing, inputs referred to as 
volatility or intermittency. 

Such "additional" random fluctuations generally vary, in time and/or in space, in 
regard to intensity (activity rate and duration) and amplitude. Typically the volatil- 
ity/intermittency may be further classified into continuous and discrete (i.e. jumps) 
elements, and long and short term effects. In turbulence the key concept of energy 
dissipation is subsumed under that of volatility/intermittency. 

The concept of (stochastic) volatility/intermittency is of major importance in many 
fields of science. Thus volatility/intermittency has a central role in mathematical 
finance and financial econometrics, in turbulence, in rain and cloud studies and 
other aspects of environmental science, in relation to nanoscale emitters, magne- 
tohydrodynamics, and to liquid mixtures of chemicals, and last but not least in the 
physics of fusion plasmas. 

As described here, volatility/intermittency is a relative concept, and its mean- 
ing depends on the particular setting under investigation. Once that meaning is 
clarified the question is how to assess the volatility/intermittency empirically and 
then to describe it in stochastic terms, for incorporation in a suitable probabilistic 
model. Important issues concern the modelling of propagating stochastic volatil- 
ity/intermittency fields and the question of predictability of volatility/intermittency. 

Section2 briefly recalls some main aspects of Ambit Stochastics that are of rel- 
evance for the dicussions in the subsequent sections, and Sect. 3 illustrates some of 
the concepts involved by two examples. The modelling of volatility/intermittency 
and energy dissipation is a main theme in Ambit Stochastics and several approaches 
to this are discussed in Sect. 4. A leading principle in the development of Ambit 
Stochastics has been to take the cue from recognised stylised features—or universal- 
ity traits—in various scientific areas, particularly turbulence, as the basis for model 
building; and in turn to seek new such traits using the models as tools. We discuss 
certain universal features observed in finance and turbulence and indicate ways to 
reproduce them in Sect. 5. Section 6 concludes and provides an outlook. 


2 Ambit Stochastics 


2.1 General Framework 


In terms of mathematical formulae, in its original form [17] (cf. also [16]) an ambit 
field is specified by 


ron su f g(x, E, t, s)o (E, s) L(dé ds) + Q(x, t) (1) 


A(x,t) 
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time 


space 


Fig. 1 A spatio-temporal ambit field. The value Y (x, t) of the field at the point marked by the black 
dot is defined through an integral over the corresponding ambit set A(x, t) marked by the shaded 
region. The circles of varying sizes indicate the stochastic volatility/intermittency. By considering 
the field along the dotted path in space-time an ambit process is obtained 


where 


9o.» | q(x, E, t, s) x (5, s) dé ds. (2) 
D(x,t) 


Here ¢ denotes time while x gives the position in d-dimensional Euclidean space. 
Further, A(x, t) and D(x, t) are subsets of R7 x R and are termed ambit sets, g and q 
are deterministic weight functions, and L denotes a Lévy basis (i.e. an independently 
scattered and infinitely divisible random measure). Further, o and x are stochastic 
fields representing aspects of the volatility/intermittency. In Ambit Stochastics the 
models of the volatility/intermittency fields ø and x are usually themselves specified 
as ambit fields. We shall refer to ø as the amplitude volatility component. Figure | 
shows a sketch of the concepts. 

The development of Y along a curve in space-time is termed an ambit process. 
As will be exemplified below, ambit processes are not in general semimartingales, 
even in the purely temporal case, i.e. where there is no spatial component x. 
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In a recent extension the structure (1) is generalised to 


vi =a f 


be ae Q(x,t) (3) 
x 


where Q is like (2) or the exponential thereof, and where T is a metatime expressing 
a further volatility/intermittency trait. The relatively new concept of metatime is 
instrumental in generalising subordination of stochastic processes by time change 
(as discussed for instance in [22]) to subordination of random measures by random 
measures. We return to this concept and its applications in the next section and refer 
also to the discussion given in [8]. 

Note however that in addition to modelling volatility/intermittency through the 
components o, x and 7, in some cases this may be supplemented by probability 
mixing or Lévy mixing as discussed in [12]. 

It might be thought that ambit sets have no role in purely temporal modelling. 
However, examples of their use in such contexts will be discussed in Sect. 3. 

In many cases it is possible to choose specifications of the volatility/intermittency 
elements c, x and T such that these are infinitely divisible or even selfdecomposable, 
making the models especially tractable analytically. We recall that the importance 
of the concept of selfdecomposability rests primarily on the possibility to represent 
selfdecomposable variates as stochastic integrals with respect to Lévy processes, 
see [32]. 

So far, the main applications of ambit stochastics has been to turbulence and, to 
a lesser degree, to financial econometrics and to bioimaging. An important potential 
area of applications is to particle transport in fluids. 


2.2 Existence of Ambit Fields 


The paper [25] develops a general theory for integrals 
xen = [ h(x, y, t, s) M(dx dx) 
R4 xR 


where h is a predictable stochastic function and M is a dispersive signed random 
measure. Central to this is that the authors establish a notion of characteristic triplet 
of M, extending that known in the purely temporal case. A major problem solved in 
that regard has been to merge the time and space aspects in a general and tractable 
fashion. Armed with that notion they determine the conditions for existence of the 
integral, analogous to those in [37] but considerably more complicated to derive and 
apply. An important property here is that now predictable integrands are allowed 
(in the purely temporal case this was done in [23]). Applications of the theory to 
Ambit Stochastics generally, and in particular to superposition of stochastic volatility 
models, is discussed. 
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Below we briefly discuss how the metatime change is incorporated in the frame- 
work of [25]. Suppose that L = {L(A) | A € By (RIT!) is a real-valued, homo- 
geneous Lévy basis with associated infinitely divisible law u € ID(R4* !), that is 
L([0, 1]2*!) is equal in law to u. Let (y, X, v) be the characteristic triplet of u. 
Thus y € R, X > 0 and v is a Lévy measure on R. 

Suppose that T = {T(A) | A € ZI (R4*) is a random meta-time associated 
with a homogeneous, real-valued, non-negative Lévy basis T — (T(A) | A € 
ZA (R17 )). That is the sets T(A) and T(B) are disjoint whenever A, B € Z(R4*!) 
are disjoint, T(UX gAn) = U%9T(An) whenever An, UX 4A, € A(R!) and 
T(A) = Lebg+1(T(A)) for all A € ZB (R4+!)., Here and in what follows, Leb, 
denotes the Lebesgue measure on IR. For the details on construction of random 
meta-times cf. [11]. Suppose also that A € 7 D(R) is the law associated to T and that 
à ~ I D(B,O, p). Thus £ > 0 and p is a Lévy measure such that p(R_) = 0 and 
fg Ax) p(dx) < oo. 

Now, by [11, Theorem 5.1] we have that Lr = (L(T(A) | A e Z(R4*)) 
is a homogeneous Lévy basis associated to u” with u* ~ I D(y*, X*, v*) and 
characteristics given by 


y* = By «f / ES 
yt = py 


v*(B) = Bv(B) + n p(B)e(ds, Be Z(R^'! \ (0), 
0 


where u’ is given by u° = ji for any s > 0. 

Finally, suppose that o (x, f) is predictable and that Lr has no fixed times of 
discontinuity (see [25]). By rewriting the stochastic integral in the right-hand side of 
(3) as 


xe.» [ H(x,$,t, s) Lr(déds), 
RdH 


with H (x, &, t, s) = lag (E, s)8 (x, £, t, $)o (5, s) we can use [25, Theorem 4.1]. 
Observe that the assumption that o is predictable is enough as both A(x, t) and 
g(x, £, t, s) are deterministic. This gives us that X is well defined for all (x, t) if the 
following hold almost surely for all (x, t) € R^*!: 


p» 


H(x,£, t, s)y* NA é,t, s)y) — H(x, £, t, s)v(y)]lv* (dy)| d&ds < oo 
(4) 


[ H?(x,&,t,s) E* d£ds < oo (5) 
Ra41 


Í i (LA (A(x, £, t, 5)y)? v* (dy)d£ds < oo. (6) 
R4 JR 


8 O.E. Barndorff-Nielsen et al. 


3 Illustrative Examples 


We can briefly indicate the character of some of the points on Ambit Stochastics 
made above by considering the following simple model classes. 


3.1 BSS and LSS Processes 


Stationary processes of the form 


t t 
Y(t) = J g(t — s)o (s) Br (ds) «f q(t — s)o (s)? ds. (7) 


are termed Brownian semistationary processes—or BSS for short. Here the setting 
is purely temporal and B7 is the time change of Brownian motion B by a chronome- 
ter T (that is, an increasing, càdlàg and stochastically continuous process ranging 
from —oo to oo), and the volatility/intermittency process o is assumed stationary. 
The components o and T represent respectively the amplitude and the intensity of 
the volatility/intermittency. If T has stationary increments then the process Y is sta- 
tionary. The process (7) can be seen as a stationary analogue of the BNS model 
introduced by Barndorff-Nielsen and Shephard [14]. 

Note that in case T increases by jumps only, the infinitesimal of the process Br 
cannot be reexpressed in the form x (s) B(ds), as would be the case if T was of type 
T, = fọ V (u) du with x = Jy. 


Further, for the exemplification we take g to be of the gamma type 


v 


~ TO) 


gs) "Tg 1G, a). (8) 


Subject to a weak (analogous to (4)) condition on o, the stochastic integral in (7) 
will exist if and only if v > 1/2 and then Y constitutes a stationary process in time. 
Moreover, Y is a semimartingale if and only if v does not lie in one of the intervals 
(1/2, 1) and (1, 3/2]. Note also that the sample path behaviour is drastically different 
between the two intervals, since, ast — 0, g(t) tends tooo when v c (1/2, 1) and toO 
when v c (1, 3/2]. Further, the sample paths are purely discontinuous if v € (1/2, 1) 
but purely continuous (of Hólder index H = v — 1/2) when v e (1, 3/2). 

Thecases where v € (1/2, 1) haveaparticular bearing in the context of turbulence, 
the value v — 5/6 having a special role in relation to the Kolmogorov-Obukhov 
theory of statistical turbulence, cf. [3, 33]. 

The class of processes obtained by substituting the Brownian motion in (7) by a 
Lévy process is referred to as the class of Lévy semistationary processes—or LSS 
processes for short. Such processes are discussed in [8, 24, Sect. 3.7] and references 
therein. 
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3.2 Trawl Processes 


The simplest non-trivial kind of ambit field is perhaps the trawl process, introduced 
in [2]. In a trawl process, the kernel function and the volatility field are constant 
and equal to 1, and so the process is given entirely by the ambit set and the Lévy 
basis. Specifically, suppose that L is a homogeneous Lévy basis on R? x R and that 
AC R4 x R is a Borel subset with finite Lebesgue measure, then we obtain a trawl 
process Y by letting A(t) = A + (0, t) and 


voc f Ladgds)= f 1.r- 9 Las ds) = LAO) (9) 
t 


The process is by construction stationary. Depending on the purpose of the modelling, 
the time component of the ambit set A may or may not be supported on the negative 
real axis. When the time component of A is supported on the negative real axis, 
we obtain a causal model. Despite their apparent simplicity, trawl processes possess 
enough flexibility to be of use. If L denotes the seed! of L, then the cumulant 
function (i.e. the distinguished logarithm of the characteristic function) of Y is given 
by 

C(z £Y(0) =IAICS EL}. (10) 


Here and later, | A| denotes the Lebesgue measure of the set A. For the mean, variance, 
autocovariance and autocorrelation it follows that 


[Y (7)] = |A| E[L’], 
var(Y (t)) = |A| var(L’), 


r(t) := cov(Y (t), Y (0)) = |A n A(t)| var(L’), (11) 
cov(Y(r), Y(0)) |An AI 
p(t) := = , 
var(Y (0)) |A| 


From this we conclude the following. The one-dimensional marginal distribution 
is determined entirely in terms of the size (not shape) of the ambit set and the 
distribution of the Lévy seed; given any infinitely divisible distribution there exists 
trawl processes having this distribution as the one-dimensional marginal; and the 
autocorrelation is determined entirely by the size of the overlap of the ambit sets, 
that is, by the shape of the ambit set A. Thus we can specify the autocorrelation 
and marginal distribution independently of each other. It is, for example, easy to 
construct a trawl process with the same autocorrelation as the OU process, see [2, 
8] for more results and details. By using integer-valued Lévy bases, integer-valued 


'The Lévy seed L'(x) at x of a Lévy basis L with control measure v is a random variable with the 
property that C{¢ £L(A)] = dy C{¢ L'(x)) v(dx). For a homogeneous Lévy basis, the distribution 
of the seed is independent of x. 
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trawl processes are obtained. These processes are studied in detail in [5] and applied 
to high frequency stock market data. 

We remark, that Y (x, t) = L(A + (x, t)) is an immediate generalisation of trawl 
processes to trawl fields. It has the same simple properties as the trawl process. 

Trawl processes can be used to directly model an object of interest, for example, 
the exponential of the trawl process has been used to model the energy dissipation, 
see the next section, or they can be used as a component in a composite model, for 
example to model the volatility/intermittency in a Brownian semistationary process. 


4 Modelling of Volatility/Intermittency/Energy 
Dissipation 


A very general approach to specifying volatility/intermittency fields for inclusion in 
an ambit field, as in (1), is to take t = o? as being given by a Lévy-driven Volterra 
field, either directly as 


ro. =f f, &,t, s) L(d&, ds) (12) 
R2xR 


with f positive and L a Lévy basis (different from L in (1), or in exponentiated form 


T(x, t) = exp (/ Flr 1,8) Latt, d»). (13) 
R¢xR 


When the goal is to have stationary volatility/intermittency fields, such as in mod- 
elling homogeneous turbulence, that can be achieved by choosing L to be homo- 
geneous and f of translation type. However, the potential in the specifications (12) 
and (13) is much wider, giving ample scope for modelling inhomogeneous fields, 
which are by far the most common, particularly in turbulence studies. Inhomogeneity 
can be expressed both by not having f of translation type and by taking the Lévy 
basis L inhomogeneous. 

In the following we discuss two aspects of the volatility/intermittency mod- 
elling issue. Trawl processes have proved to be a useful tool for the modelling of 
volatility/intermittency and in particular for the modelling of the energy dissipation, 
as outlined in Sect. 4.1. Section 4.2 reports on a recent paper on relative volatil- 
ity/intermittency. In Sect. 4.3 we discuss the applicability of selfdecomposability to 
the construction of volatility/intermittency fields. 
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4.1 The Energy Dissipation 


In [31] it has been shown that exponentials of trawl processes are able to repro- 
duce the main stylized features of the (surrogate) energy dissipation observed for a 
wide range of datasets. Those stylized features include the one-dimensional marginal 
distributions and the scaling and self-scaling of the correlators. 

The correlator of order (p, q) is defined by 


|. E[e(r)^e(t + 5Y!] 
Cra) = EeP Elet + sy] m 


The correlator is a natural analogue to the autocorrelation when one considers a 
purely positive process. In turbulence it is known (see the reference cited in [31]) 
that the correlator of the surrogate energy dissipation displays a scaling behaviour 
for a certain range of lags, 


Cp, a (5) x 579, Tagan «s « Tlarge: (15) 


where t (p, q) is the scaling exponent. The exponent r (1, 1) is the so-called intermit- 
tency exponent. Typical values are in the range 0.1 to 0.2. The intermittency exponent 
quantifies the deviation from Kolmogorov's 1941 theory and emphasizes the role of 
intermittency (i.e. volatility) in turbulence. In some cases, however, the scaling range 
of the correlators can be quite small and therefore it can be difficult to determine the 
value of the scaling exponents, especially when p and q are large. Therefore one also 
considers the correlator of one order as a function of a correlator of another order. In 
this case, self-scaling is observed, i.e., the one correlator is proportional to a power 
of the other correlator, 

Cg I) TN, (16) 


where c (p, q; p', q’) is the self-scaling exponent. The self-scaling exponents have 
turned out to be much easier to determine from data than the scaling exponents, 
and like the scaling exponents, the self-scaling exponents have proved to be key 
fingerprints of turbulence. They are essentially universal in that they vary very little 
from one dataset to another, covering a large range of the so-called Reynolds numbers, 
a dimensionless quantity describing the character of the flow. 

In [31] the surrogate energy dissipation & is, more specifically, modelled as 


e(t) = exp(L(A(r))), (17) 


where L is a homogeneous Lévy basis on R x IR and A(t) = A+ (0, t) for a bounded 
set A C R x R. The ambit set A is given as 


A—Í(( D) eRxRJIOxt < Tage. —f(0 € x S FO}, (18) 
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1 1 1 1 i 1 1 1 1 i 1 1 1 i 1 i i 1 1 1 i a 
- 1.0 -0.5 0.0 0.5 1.0 
x 


Fig. 2 The shaded region marks the ambit set A from (18) defined by (19) where the parameters 
are chosen to be Tlarge = 1, Tama] = 0.1 and @ = 5 


where Tlarge 0. For Tlarge > Tsmal] > 0 and 0 > 0, the function f is defined as 


Ies (t/ Targo)” di 


The shape of the ambit set is chosen so that the scaling behaviour (15) of the corre- 
lators is reproduced. The exact values of the scaling exponents are determined from 
the distribution of the Lévy seed of the Lévy basis. The two parameters T;,4j] and 
Tlarge determine the size of the small and large scales of turbulence: in between we 
have the inertial range. The final parameter 0 is a tuning parameter which accounts 
for the lack of perfect scaling and essentially just allows for a better fit. (Perfect 
scaling is obtained in the limit 0 — oo). See Fig.2 for an example. Furthermore, 
self-scaling exponents are predicted from the shape, not location and scale, of the 
one-point distribution of the energy dissipation alone. 

To determine a proper distribution of the Lévy seed of L, it is in [31] shown 
that the one-dimensional marginal of the logarithm of the energy dissipation is well 
described by a normal inverse Gaussian distribution, i.e. log e(t) ~ NIG(o, P, u, ô), 
where the shape parameters o and f are the same for all datasets (independent of 
the Reynolds number). Thus the shape of the distribution of the energy dissipation 
is a newly discovered universal feature of turbulence. Thus we see that L should be 
a normal inverse Gaussian Lévy basis whose parameters are given by the observed 
distribution of log e(t). This completely specifies the parameters of (17). 
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4.2 Realised Relative Volatility/Intermittency/Energy 
Dissipation 


By its very nature, volatility/intermittency is a relative concept, delineating varia- 
tion that is relative to a conceived, simpler model. But also in a model for volatil- 
ity/intermittency in itself it is relevant to have the relative character in mind, as 
will be further discussed below. We refer to this latter aspect as relative volatil- 
ity/intermittency and will consider assessment of that by realised relative volatil- 
ity/intermittency which is defined in terms of quadratic variation. The ultimate 
purpose of the concept of relative volatility/intermittency is to assess the volatil- 
ity/intermittency or energy dissipation in arbitrary subregions of a region C of space- 
time relative to the total volatility/intermittency/energy dissipation in C. In the purely 
temporal setting the realised relative volatility/intermittency is defined by 


[Ys]; /[Ys] (20) 


where [Ys]; denotes the realised quadratic variation of the process Y observed with 
lag ô over a time interval [0, t]. We refer to this quantity as RRQV (for realised 
relative quadratic volatility). 

As mentioned in Example 1, in case g is the gamma kernel (8) with v € (1/2, 1)U 
(1, 3/2] then the BSS process (7) is not a semimartingale. In particular, if v € 
(1/2, 1)—the case of most interest for the study of turbulence—the realised quadratic 
variation [Ys], does not converge as it would if Y was a semimartingale; in fact it 
diverges to infinity whereas in the semimartingale case it will generally converge to 
the accumulated volatility/intermittency 


t 
o2* = Í c? ds, Q1) 


which is an object of key interest (in turbulence it represents the coarse-grained 
energy dissipation). However the situation can be remedied by adjusting [Ys]; by a 
factor depending on v; in wide generality it holds that 


= P 
c? IY], — o2* (22) 


as 8 — 0, with x = 47120-12 (rw) + FW + 1/2))/F v — 1)r(3/2— 1). To 
apply this requires knowledge of the value of v and in general v must be estimated with 
sufficient precision to ensure that substituting the estimate for the theoretical value of 
v in (22) will still yield convergence in probability. Under relatively mild conditions 
that is possible, as discussed in [26] and the references therein. An important aspect 
of formula (20) is that its use does not involve knowledge of v as the adjustment 
factor cancels out (Fig. 3). 

Convergence in probability and a central limit theorem for the RRQV is estab- 
lished in [10]. Figure 2 illustrates its use, for two sections ofthe “Brookhaven” dataset, 
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Fig. 3 Brookhaven turbulence data periods 18 and 25—RRQV and 95 % confidence intervals 


one where the volatility effect was deemed by eye to be very small and one where 
it appeared strong. (The *Brookhaven" dataset consist of 20 million one-point mea- 
surements of the longitudinal component of the wind velocity in the atmospheric 
boundary layer, 35m above ground. The measurements were performed using a 
hot-wire anemometer and sampled at 5 kHz. The time series can be assumed to be 
stationary. We refer to [27] for further details on the dataset; the dataset is called 
no. 3 therein). 


4.3 Role of Selfdecomposability 


If t is given by (12) it is automatically infinitely divisible, and selfdecomposable 
provided L has that property; whereas if t is defined by (13) it will only in exceptional 
cases be infinitely divisible. 

A non-trivial example of such an exceptional case is the following. The Gumbel 
distribution with density 


4 X—a X—a 23 
fla) = zew( 5 exp ( b )). (23) 


where a € R and b > O is infinitely divisible [39]. In [31] it was demonstrated that 
the one-dimensional marginal distribution of the logarithm of the energy dissipation 
is accurately described by a normal inverse Gaussian distribution. One may also 
show (not done here) that the Gumbel distribution with b — 2 provides another fit 
that is nearly as accurate as the normal inverse Gaussian. Furthermore, if X is a 
Gumbel random variable with b = 2, then exp(X) is distributed as the square of 
an exponential random variable, hence also infinitely divisible by [39]. Therefore, if 
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the Lévy basis L in (17) is chosen so that L(A) follows a Gumbel distribution with 
b = 2, then exp(L(A(t))) will be infinitely divisible. 

For a general discussion of selfdecomposable fields we refer to [13]. See also [32] 
which provides a survey of when a selfdecomposable random variable can be rep- 
resented as a stochastic integral, like in (12). Representations of that kind allow, in 
particular, the construction of field-valued processes of OU or supOU type that may 
be viewed as propagating, in time, an initial volatility/intermittency field defined on 
the spatial component of space-time for a fixed time, say t = 0. Similarly, suppose 
that a model has been formulated for the time-wise development of a stochastic field 
at a single point in space. One may then seek to define a field on space-time such that 
at every other point of space the time-wise development of the field is stochastically 
the same as at the original space point and such that the field as a whole is stationary 
and selfdecomposable. 


Example I (One dimensional turbulence) Let Y denote an ambit field in the tempo- 
spatial case where the spatial dimension is 1, and assume that for a preliminary purely 
temporal model X of the same turbulent phenomenon a model has been formulated 
for the squared amplitude volatility component, say w. It may then be desirable 
to devise Y such that the volatility/intermittency field t = o? is stationary and 
infinitely divisible, and such that for every spatial position x the law of t(x, - ) is 
identical to that formulated for the temporal setting, i.e. w. If the temporal process 
is selfdecomposable then, subject to a further weak condition (see [13]), such a field 
can be constructed. 

To sketch how this may proceed, recall first that the classical definition of self- 
decomposability of a process X says that all the finite-dimensional marginal distri- 
butions of X should be selfdecomposable. Accordingly, due to a result by [38], for 
any finite set of time points 7 = (u1, ..., Un) the selfdecomposable vector variable 
X (à) = (X (u1), ..., X (un)) has a representation 


X(a) = ru e 5 L(d£, û) 
0 


for some n-dimensional Lévy process L(- , û), provided only that the Lévy measure 
of X (à) has finite log-moment. We now assume this to be the case and that X is 
stationary 

Next, for fixed û, let (L(x, û) | x € R}, be the n-dimensional Lévy process having 
the property that the law of L(1, û) is equal to the law of X (å). Then the integral 


X(x, i) = n e 5 L(d£, it) 


—oo 


exists and the process (X (x, à) | x € R} will be stationary—of Ornstein-Uhlenbeck 
type—while for each x the law of X (x, 4) will be the same as that of X (ù). 
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However, off hand the Lévy processes L(-,d) corresponding to different sets a 
of time points may have no dynamic relationship to each other, while the aim is to 
obtain a stationary selfdecomposable field X (x, t) such that X (x, - ) has the same 
law as X forall x € R. But, arguing along the lines of theorem 3.4 in [9], itis possible 
to choose the representative processes L(-, û) so that they are all defined on a single 
probability space and are consistent among themselves (in analogy to Kolmogorov’s 
consistency result); and that establishes the existence of the desired field X (x, t). 
Moreover, X(-, -) is selfdecomposable, as is simple to verify. 

The same result can be shown more directly using master Lévy measures and the 
associated Lévy-Ito representations, cf. [13]. 


Example 2 Assume that X has the form 


xod J gu — £) L(dé) 24) 


where L is a Lévy process. 

It has been shown in [13] that, in this case, provided g is integrable with respect 
to the Lebesgue measure, as well as to L, and if the Fourier transform of g is non- 
vanishing, then X, as a process, is selfdecomposable if and only if L is selfdecompos- 
able. When that holds we may, as above, construct a selfdecomposable field X (x, t) 
with X (x, -) ~ X(-) for every x € R and X( - , t) of OU type for every t € R. 

As an illustration, suppose that g is the gamma kernel (8) with v € (1/2, 1). Then 
the Fourier transform of g is 


86)-üu-ic/A) *. 


and hence, provided that L is such that the integral (24) exists, the field X (x, u) is 
stationary and selfdecomposable, and has the OU type character described above. 


5 Time Change and Universality in Turbulence 
and Finance 


5.1 Distributional Collapse 


In [4], Barndorff-Nielsen et al. demonstrate two properties of the distributions of 
increments A; X (t) = X (t) — X (t — £) of turbulent velocities. Firstly, the increment 
distributions are parsimonious, i.e., they are described well by a distribution with 
few parameters, even across distinct experiments. Specifically it is shown that the 
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four-parameter family of normal inverse Gaussian distributions (NIG(q, P, u, 5)) 
provides excellent fits across a wide range of lags £, 


AcX ~ NIG(a(£), pŒ), u£), 8(£)). (25) 


Secondly, the increment distributions are universal, i.e., the distributions are the same 
for distinct experiments, if just the scale parameters agree, 


Ap, X1 ~ Ae, X» ifandonly if 4, (€1) = 62(£2), (26) 


provided the original velocities (not increments) have been non-dimensionalized 
by standardizing to zero mean and unit variance. Motivated by this, the notion of 
stochastic equivalence class is introduced. 

The line of study initiated in [4] is continued in [16], where the analysis is extended 
to many more data sets, and it is observed that 


Ae, X1 ~ Ae, X2 if and only if var(Ag, X1) ~ var(Ae, X2), (27) 


which is a simpler statement than (26), since it does not involve any specific distri- 
bution. In [21], Barndorff-Nielsen et al. extend the analysis from fluid velocities in 
turbulence to currency and metal returns in finance and demonstrate that (27) holds 
when X; denotes the log-price, so increments are log-returns. Further corrobora- 
tion of the existence of this phenomenon in finance is presented in the following 
subsection. 

A conclusion from the cited works is that within the context of turbulence or 
finance there exists a family of distributions such that for many distinct experiments 
and a wide range of lags, the corresponding increments are distributed according 
to a member of this family. Moreover, this member is uniquely determined by the 
variance of the increments. 

Up till recently these stylised features had not been given any theoretical back- 
ground. However, in [20], a class of stochastic processes is introduced that exactly 
has the rescaling property in question. 


5.2 A First Look at Financial Data from SP500 


Motivated by the developments discussed in the previous subsection, in the following 
we complement the analyses in [4, 16, 21] with 29 assets from Standard & Poor's 
500 stock market index. The following assets were selected for study: AA, AIG, 
AXP, BA, BAC, C, CAT, CVX, DD, DIS, GE, GM, HD, IBM, INTC, JNJ, JPM, KO, 
MCD, MMM, MRK, MSFT, PG, SPY, T, UTX, VZ, WMT, XOM. For each asset, 
between 7 and 12 years of data is available. A sample time series of the log-price of 
asset C is displayed in Fig.4, where the thin vertical line marks the day 2008-01-01. 
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Fig.4 Time series of the log-price of asset C on an arbitrary scale. The thin vertical line marks the 
day 2008-01-01 and divides the dataset into the two subsets “pre” (blue) and “post” (yellow) 


Asset C is found to be representative of the feature of all the other datasets. Each 
dataset is divided into two subsets: the “pre” subset consisting of data from before 
2008-01-01 and the "post" subset consisting of data from after 2008-01-01. This 
subdivision was chosen since the volatility in the “post” dataset is visibly higher 
than in the “pre” dataset, presumably due to the financial crisis. The data has been 
provided by Lunde (Aarhus University), see also [29]. 

Figure5 shows that the distributions of log-returns across a wide range of lags 
ranging from 15s to approximately 4.5h are quite accurately described by normal 
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Fig.5 Probability densities on a log-scale for the log-returns of asset C at various lags ranging from 
1 to 16384s. The dots denote the data and the solid line denotes the fitted NIG distribution. Blue 
and yellow denote the “pre” and “post” datasets, respectively. The log-returns have been multiplied 
with 100in order to un-clutter the labeling of the x-axes 
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inverse Gaussian distributions, except at the smallest lags where the empirical dis- 
tributions are irregular. We suspect this is due to market microstructure noise. The 
accuracy of the fits is not surprising given that numerous publications have demon- 
strated the applicability of the generalised hyperbolic distribution, in particular the 
subfamily consisting of the normal inverse Gaussian distributions, to describe finan- 
cial datasets. See for example [1, 14, 15, 28, 40]. We note the transition from a 
highly peaked distribution towards the Gaussian as the lag increases. 

Next, we see on Fig.6 that the distributions at the same lag of the log-returns for 
the 29 assets are quite different, that is, they do not collapse onto the same curve. This 
holds for both the “pre” and the “post” datasets. However, the transition from a highly 
peaked distribution at small lags towards a Gaussian at large lags hints that a suitable 
change of time, though highly nonlinear, may cause such a collapse. Motivated by 
the observations in [21] we therefore consider the variance of the log-returns as a 
function of the lag. Figure 7 shows how the variance depends on the lag. Except at the 
smallest lags, a clear power law is observed. The behaviour at the smallest lags is due 
to market microstructure noise [29]. Nine variances have been selected to represent 
most of the variances observed in the 29 assets. For each selected variance and each 
asset the corresponding lag is computed. We note that for the smallest lags/variances 
this is not without difficulty since for some of the assets the slope approaches zero. 

Finally, Fig.8 displays the distributions of log-returns where the lag for each 
asset has been chosen such that the variance is the one specified in each subplot. The 
difference between Figs. 6 and 8 is pronounced. We see that for both the “pre” and 
the "post" dataset, the distributions corresponding to the same variance tend to be 
the same. Furthermore, when the “pre” and “post” datasets are displayed together, 
essentially overlaying the top part of Fig. 8 with the bottom part, a decent overlap is 
still observed. So while the distributions in Fig.8 do not collapse perfectly onto the 
same curve for all the chosen variances, in contrast to what is the case for velocity 
increments in turbulence (see [4]), we are invariably led to the preliminary conclusion 
that also in the case of the analysed assets from S&P500, a family of distributions 
exists such that all distributions of log-returns are members of this family and such 
that the variance of the log-returns uniquely determines this member. The lack of 
collapse at the smaller variances may in part be explained by the difficulty in reading 
off the corresponding lags. 

The observed parsimony and in particular universality has implications for mod- 
elling since any proper model should possess both features. Within the context of 
turbulence, BSS-processes have been shown to be able to reproduce many key fea- 
tures of turbulence, see [35] and the following subsection for a recent example. 
The extent to which BSS-processes in general possess universality is still ongoing 
research [20] but results indicate that BSS-processes and in general LSS-process are 
good candidates for models where parsimony and universality are desired features. 
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Fig. 6 Probability densities on a log-scale for the log-returns of all 29 assets at various lags ranging 
from 1 to 16384s. The top and bottom halfs represent the “pre” and “post” datasets, respectively. 
The log-returns have been multiplied with 100 in order to un-clutter the labeling of the x-axes 
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Fig. 7 The variance of the log-returns for the 29 assets as a function of the lag displayed in 
a double logarithmic representation. The top and bottom graphs represent the "pre" and “post” 
datasets, respectively 


5.3 Modelling Turbulent Velocity Time Series 


A specific time-wise version of (1), called Brownian semistationary processes has 
been proposed in [18, 19] as a model for turbulent velocity time series. It was shown 
that BSS processes in combination with continuous cascade models (exponentials of 
certain trawl processes) are able to qualitatively capture some main stylized features 
of turbulent time series. 
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Fig. 8 Probability densities on a log-scale of log-returns where the lag for each asset has been 
chosen such that the variances of the assets in each subplot is the same. The chosen variances are 
also displayed in Fig. 7 as horizonal lines. For the smallest and largest variances, not all dataset are 
present since for some datasets those variances are not attained. Top "pre", bottom “post” 
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Recently this analysis has been extended to a quantitative comparison with turbu- 
lent data [31, 35]. More specifically, based on the results for the energy dissipation 
oulined in Sect. 4.1, BSS processes have been analyzed and compared in detail to 
turbulent velocity time series in [35] by directly estimating the model parameters 
from data. Here we briefly summarize this analysis. 

Time series of the main component v; of the turbulent velocity field are modelled 
as a BSS process of the specific form 


t t 
v(t) = v(t; g,0, p) =f s0- eG) Bas) | g(t — s)o (s)? ds =: R(t) + S(t) 

oo oo (28) 
where g is a non-negative L? (R+) function, ø is a stationary process independent 
of B, $ isa constant and B denotes standard Brownian motion. An argument based 
on quadratic variation shows that when g(04-) 0, then o? can be identified with 
the surrogate energy dissipation, o? = e, where e is the process given by (17). The 
kernel g is specified as a slightly shifted convolution of gamma kernels [30], 


g(t) = golt + to), 
golt) = at" 7l exp(—Agt)1 Fi (vi, vi + v2, (A2 — A1)t)1 (0,00) (t) 


with a > 0, vj > O and A; > 0. Here ; F; denotes the Kummer confluent hypergeo- 
metric function. The shift is needed to ensure that g(0+) Æ 0. 

The data set analysed consists of one-point time records of the longitudinal (along 
the mean flow) velocity component in a gaseous helium jet flow with a Taylor 
Reynolds number R; — 985. The same data set is also analyzed in [31] and the 
estimated parameters there are used to specify o? = e in (28). The remaining para- 
meters for the kernel g and the constant 6 can then be estimated from the second 
and third order structure function, that 1s, the second and third order moments of 
velocity increments. In [35] it is shown that the second order structure function is 
excellently reproduced and that the details of the third order structure function are 
well captured. It is important to note that the model is completely specified from the 
energy dissipation statistics and the second and third order structure functions. 

The estimated model for the velocity is then succesfully compared with other 
derived quantities, including higher order structure functions, the distributions of 
velocity increments and their evolution as a function of lag, the so-called Kolmogorov 
variable and the energy dissipation, as prediced by the model. 


6 Conclusion and Outlook 


The present paper highlights some of the most recent developments in the theory 
and applications of Ambit Stochastics. In particular, we have discussed the existence 
of the ambit fields driven by metatime changed Lévy bases, selfdecomposability of 
random fields [13], applications of BSS processes in the modelling of turbulent time 
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series [35] and new results on the distributional collapse in financial data. Some of 
the topics not mentioned here but also under development are the integration theory 
with respect to time-changed volatility modulated Lévy bases [7]; integration with 
respect to volatility Gaussian processes in the White Noise Analysis setting in the 
spirit of [34] and extending [6]; modelling of multidimensional turbulence based 
on ambit fields; and in-depth study of parsimony and universality in BSS and LSS 
processes motivated by some of the discussions in the present paper. 


Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 
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Abstract First, we revisit basic theory of functional Itó/path-dependent calculus, 
using the formulation of calculus via regularization. Relations with the corresponding 
Banach space valued calculus are explored. The second part of the paper is devoted 
to the study of the Kolmogorov type equation associated with the so called window 


Brownian motion, called path-dependent heat equation, for which well-posedness at 


the level of strict solutions is established. Then, a notion of strong approximating 


solution, called strong-viscosity solution, is introduced which is supposed to be a 


substitution tool to the viscosity solution. For that kind of solution, we also prove 


existence and uniqueness. 
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1 Introduction 


The present work collects several results obtained in the papers [9, 10], focusing 
on the study of some specific examples and particular cases, for which an ad hoc 
analysis is developed. This work is an improved version of [8], trying to explain more 
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precisely some details. For example, in [8] a slightly more restrictive definition of 
strong-viscosity solution was adopted, see Remark 12. 

Recently, a new branch of stochastic calculus has appeared, known as functional 
Itó calculus, which results to be an extension of classical Itó calculus to functionals 
depending on the entire path of a stochastic process and not only on its current value, 
see Dupire [17], Cont and Fournié [5-7]. Independently, Di Girolami and Russo, and 
more recently Fabbri, Di Girolami, and Russo, have introduced a stochastic calculus 
via regularizations for processes taking values in a separable Banach space B (see 
[12-16], including the case B = C([—T,0]), which concerns the applications to 
the path-dependent calculus. 

In the first part of the present paper, we follow [9] and revisit functional Itó cal- 
culus by means of stochastic calculus via regularization. We recall that Cont and 
Fournié [5—7] developed functional Itó calculus and derived a functional Itó's for- 
mula using discretization techniques of Fóllmer [23] type, instead of regularization 
techniques, which in our opinion, better fit to the notion of derivative. Let us illus- 
trate another difference with respect to [5]. One of the main issues of functional Itó 
calculus is the definition of the functional (or pathwise) derivatives, i.e., the hori- 
zontal derivative (calling in only the past values of the trajectory) and the vertical 
derivative (calling in only the present value of the trajectory). In [5], it is essential 
to consider functionals defined on the space of càdlàg trajectories, since the def- 
inition of functional derivatives necessitates of discontinuous paths. Therefore, if 
a functional is defined only on the space of continuous trajectories (because, e.g., 
it depends on the paths of a continuous process as Brownian motion), we have to 
extend it anyway to the space of càdlàg trajectories, even though, in general, there is 
no unique way to extend it. In contrast to this approach, we introduce a new space 
larger than the space of continuous trajectories C ([—T, 0]), denoted by € ([—T, 0]), 
which allows us to define functional derivatives. € ([—T, 0]) is the space of bounded 
trajectories on [—T, 0], continuous on [—7, O[ and with possibly a jump at 0. We 
endow € ([—T, 0]) with a topology such that C([—T, 0]) is dense in & ([—T, 0]) with 
respect to this topology. Therefore, any functional Y : [0, T] x C([-T,0]) —> R, 
continuous with respect to the topology of ([—T, 0]), admits a unique extension 
to € ([—T, 0]), denoted u: [0, T] x @([—T, 0]) — R. We present some significant 
functionals for which a continuous extension exists. Then, we develop the functional 
Itó calculus for u: [0, T] x €([—T,0]) > R. 

Notice that we use a slightly different notation compared with [5]. In partic- 
ular, in place of a map 7 : [0, T] x C([-7,0]) — R, in [5] a family of maps 
F = (Fi)yiepo,ri, with F;: C([0, t]) — R, is considered. However, we can always 
move from one formulation to the other. Indeed, given F = (F;);e[o, r], where each 
F,: C([0, t]) — R, we can define Y : [0, T] x C([—T, 0]) — R as follows: 


Ut, n) := Boer Diag  @, n) € 0, T] x C(E-T, 0). 
Vice-versa, let Y : [0, T] x C([-T, 0]) — R and define F = (F;)iepo,; as 


F) = Vm. (t, ñ) € [0, T] x C((0, rp, (1) 
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where 7 is the element of C([—T, 0]) obtained from ñ firstly translating 7 on the 
interval [—f, 0], then extending it in a constant way up to —7, namely n(x) :— 
n(x + Hla) + 901p. r5), for any x € [—T, 0]. Observe that, in 
principle, the map Y contains more information than F, since in (1) we do not take 
into account the values of Y at (t, n) € [0, T] x C({—T, 0]) with 7 not constant 
on the interval [—T, —t]. Despite this, the equivalence between the two notations 
is guaranteed; indeed, when we consider the composition of 7/ with a stochastic 
process, this extra information plays no role. Our formulation has two advantages. 
Firstly, we can work with a single map instead of a family of maps. In addition, 
the time variable and the path have two distinct roles in our setting, as for the time 
variable and the space variable in the classical Itó calculus. This, in particular, allows 
us to define the horizontal derivative independently of the time derivative, so that, 
the horizontal derivative defined in [5] corresponds to the sum of our horizontal 
derivative and of the time derivative. We mention that an alternative approach to 
functional derivatives was introduced in [1]. 

We end the first part of the paper showing how our functional Itó's formula is 
strictly related to the Itó's formula derived in the framework of Banach space valued 
stochastic calculus via regularization, for the case of window processes. This new 
branch of stochastic calculus has been recently conceived and developed in many 
directions in [12, 14—16]; for more details see [13]. For the particular case of window 
processes, we also refer to Theorem 6.3 and Sect. 7.2 in [12]. In the present paper, we 
prove formulae which allow to express functional derivatives in terms of differential 
operators arising in the Banach space valued stochastic calculus via regularization, 
with the aim of identifying the building blocks of our functional Itó's formula with 
the terms appearing in the Itó's formula for window processes. 

Dupire [17] introduced also the concept of path-dependent partial differential 
equation, to which the second part of the present paper is devoted. Di Girolami and 
Russo, in Chap.9 of [13], considered existence of regular solutions associated with 
a path dependent heat equation (which is indeed the Kolmogorov equation related to 
window Brownian motion) with a Fréchet smooth final condition. This was performed 
in the framework of Banach space valued calculus, for which we refer also to [22]. 
A flavour of the notion of regular solution in the Banach space framework, appeared 
in Chap. IV of [30] which introduced the notion of weak infinitesimal generator (in 
some weak sense) of the window Brownian motion and more general solutions of 
functional dependent stochastic differential equations. Indeed, the monograph [30] 
by Mohammed constitutes an excellent early contribution to the theory of this kind 
of equations. 

We focus on semilinear parabolic path-dependent equations associated to the 
window Brownian motion. For more general equations we refer to [9] (for strict 
solutions) and to [10] (for strong-viscosity solutions). First, we consider regular 
solution, which we call strict solutions, in the framework of functional Itó calculus. 
We prove a uniqueness result for this kind of solution, showing that, if a strict solution 
exists, then it can be expressed through the unique solution to a certain backward 
stochastic differential equation (BSDE). Then, we prove an existence result for strict 
solutions. 
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However, this notion of solution turns out to be unsuitable to deal with all signif- 
icant examples. As a matter of fact, if we consider the path-dependent PDE arising 
in the hedging problem of lookback contingent claims, we can not expect too much 
regularity of the solution (this example is studied in detail in Sect. 3.2). Therefore, 
we are led to consider a weaker notion of solution. In particular, we are interested in a 
viscosity-type solution, namely a solution which is not required to be differentiable. 

The issue of providing a suitable definition of viscosity solutions for path- 
dependent PDEs has attracted a great interest, see Peng [33] and Tang and Zhang 
[42], Ekren et al. [18-20], Ren et al. [34]. In particular, the definition of viscos- 
ity solution provided by [18—20, 34] is characterized by the fact that the classical 
minimum/maximum property, which appears in the standard definition of viscosity 
solution, is replaced with an optimal stopping problem under nonlinear expecta- 
tion [21]. Then, probability plays an essential role in this latter definition, which can 
not be considered as a purely analytic object as the classical definition of viscosity 
solution is; it is, more properly, a probabilistic version of the classical definition of 
viscosity solution. We also emphasize that a similar notion of solution, called sto- 
chastic weak solution, has been introduced in the recent paper [29] in the context of 
variational inequalities for the Snell envelope associated to a non-Markovian con- 
tinuous process X. Those authors also revisit functional Itó calculus, making use of 
stopping times. This approach seems very promising. Instead, our aim is to provide a 
definition of viscosity type solution, which has the peculiarity to be a purely analytic 
object; this will be called a strong-viscosity solution to distinguish it from the clas- 
sical notion of viscosity solution. A strong-viscosity solution to a path-dependent 
partial differential equation is defined, in a few words, as the pointwise limit of strict 
solutions to perturbed equations. We notice that the definition of strong-viscosity 
solution is similar in spirit to the vanishing viscosity method, which represents one 
of the primitive ideas leading to the conception of the modern definition of viscosity 
solution. Moreover, it has also some similarities with the definition of good solution, 
which turned out to be equivalent to the definition of L? -viscosity solution for certain 
fully nonlinear partial differential equations, see, e.g., [3, 11, 27, 28]. Finally, our 
definition is likewise inspired by the notion of strong solution (which justifies the first 
word in the name of our solution), as defined for example in [2, 24, 25], even though 
strong solutions are required to be more regular (this regularity is usually required 
to prove uniqueness of strong solutions, which for example in [24, 25] is based on 
a Fukushima-Dirichlet decomposition). Instead, our definition of strong-viscosity 
solution to the path-dependent semilinear Kolmogorov equation is not required to 
be continuous, as in the spirit of viscosity solutions. The term viscosity in the name 
of our solution is also justified by the fact that in the finite dimensional case we 
have an equivalence result between the notion of strong-viscosity solution and that 
of viscosity solution, see Theorem 3.7 in [8]. We prove a uniqueness theorem for 
strong-viscosity solutions using the theory of backward stochastic differential equa- 
tions and we provide an existence result. We refer to [10] for more general results 
(when the path-dependent equation is not the path-dependent heat equation) and 
also for the application of strong-viscosity solutions to standard semilinear parabolic 
PDEs. 
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The paper is organized as follows. In Sect. 2 we develop functional Itó calculus via 
regularization following [9]: after a brief introduction on finite dimensional stochastic 
calculus via regularization in Sect. 2.1, we introduce and study the space @ ([—T, 0]) 
in Sect. 2.2; then, we define the pathwise derivatives and we prove the functional 
Itó's formula in Sect. 2.3; in Sect. 2.4, instead, we discuss the relation between func- 
tional Itó calculus via regularization and Banach space valued stochastic calculus 
for window processes. In Sect.3, on the other hand, we study path-dependent PDEs 
following [10]. More precisely, in Sect.3.1 we discuss strict solutions; in Sect. 3.2 
we present a significant hedging example to motivate the introduction of a weaker 
notion of solution; finally, in Sect. 3.3 we provide the definition of strong-viscosity 
solution. 


2 Functional Itó Calculus: A Regularization Approach 


2.1 Background: Finite Dimensional Calculus 
via Regularization 


The theory of stochastic calculus via regularization has been developed in several 
papers, starting from [37, 38]. We recall below only the results used in the present 
paper, and we refer to [40] for a survey on the subject. We emphasize that integrands 
are allowed to be anticipating. Moreover, the integration theory and calculus appear 
to be close to a pure pathwise approach even though there is still a probability space 
behind. 

Fix a probability space (2, F, P) and T €]0, oo[. Let F = (.Z;);epo, rj denote a 
filtration satisfying the usual conditions. Let X = (X;);eo,7] (resp. Y = (Yo)repo,r]) 
be a real continuous (resp. P-a.s. integrable) process. Every real continuous process 
X = (Xpi)iepo,r] is naturally extended to all t € R setting X; = Xo, t < 0, and 
X, = Xr, t > T. We also define a C([—T, 0])-valued process X = (X;);cm, called 
the window process associated with X, defined by 


X; := {Xi4x, x € [-T, 0]}, t € R. (2) 


This corresponds to the so-called segment process which appears for instance in [43]. 


Definition 1 Suppose that, for every t € [0, T], the limit 


t f X = X 
] rox, = lim | pds, (3) 
0 e>0t Jo € 


exists in probability. If the obtained random function admits a continuous modifi- 
cation, that process is denoted by Jo Yd" X and called forward integral of Y with 
respect to X. 


32 A. Cosso and F. Russo 


Definition 2 A family of processes (Heo. ri is said to converge to (H;)+e[0,7] 
in the ucp sense, if supo, 7 | Hf — H,| goes to 0 in probability, as e > OT. 


Proposition 1 Suppose that the limit (3) exists in the ucp sense. Then, the forward 
integral o Yd X ofY with respect to X exists. 


Let us introduce the concept of covariation, which is a crucial notion in stochastic 
calculus via regularization. Let us suppose that X, Y are continuous processes. 


Definition 3 The covariation of X and Y is defined by 


EM 
[E Yl eh ia J (Xue Xek 1 € [0,7], 
0 


e>0t € 


if the limit exists in probability for every t € [0, T], provided that the limiting random 
function admits a continuous version (this is the case if the limit holds in the ucp 
sense). If X = Y, X is said to be a finite quadratic variation process and we set 
[X] := [X, X]. 


The forward integral and the covariation generalize the classical Itó integral and 
covariation for semimartingales. In particular, we have the following result, for a 
proof we refer to, e.g., [40]. 


Proposition 2 The following properties hold. 


(i) Let S', S? be continuous F-semimartingales. Then, [S!, S?] is the classical 
bracket [S!, S?] = (M1, M?), where M! (resp. M?) is the local martingale 
part of S! (resp. S?). 

(ii) Let V be a continuous bounded variation process and Y be a càdlàg process 
(or vice-versa); then [V] — [Y, V] — 0. Moreover li Yd V = li YdV, is the 
Lebesgue-Stieltjes integral. 

(üi) IfW isa Brownian motion and Y is an F-progressively measurable process such 
iy T Y2ds « oo, P-a.s., then Ti Yd W exists and equals the Itó integral 

o Y4W. 


We could have defined the forward integral using limits of non-anticipating Riemann 
sums. Another reason to use the regularization procedure is due to the fact that it 
extends the Itó integral, as Proposition 2(iii) shows. If the integrand had uncountable 
jumps (as Y being the indicator function of the rational number in [0, 1]) then, 
the Itó integral ho YdW would be zero Y = O0 a.e. The limit of Riemann sums 
>i Y; (Wi, — Wy) would heavily depend on the discretization grid. 

We end this crash introduction to finite dimensional stochastic calculus via regu- 
larization presenting one of its cornerstones: Itó's formula. Itis a well-known result in 
the theory of semimartingales, but it also extends to the framework of finite quadratic 
variation processes. For a proof we refer to Theorem 2.1 of [39]. 
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Theorem 1 Let F : [0, T] x R — R be of class C? ([0, T] x R) and X = 
(Xi)rejo, r] be a real continuous finite quadratic variation process. Then, the follow- 
ing Itó's formula holds, P-a.s., 


t 


t 
F(t, Xy) = F(0, xo) + | 9; F(s, xods  [ QyF(s,X,)d X; 
0 0 


1 $ 
+5 | a2, F(s, X [X]. O<1<T. @) 


2.1.1 The Deterministic Calculus via Regularization 


A useful particular case of finite dimensional stochastic calculus via regularization 
arises when €2 is a singleton, i.e., when the calculus becomes deterministic. In addi- 
tion, in this deterministic framework we will make use of the definite integral on 
an interval [a, b], where a < b are two real numbers. Typically, we will consider 
a = —T ora = —t and b = 0. 

We start with two conventions. By default, every bounded variation function 
f: [a,b] > R will be considered as càdlàg. Moreover, given a function f : [a,b] > 
R, we will consider the following two extensions of f to the entire real line: 


0 xb, f(b), x» b, 
fix) = 4 f(x), x € [a,b], f7@) = 4 f@), x € [a, 5], 
f(a), x <a, 0, x <a, 


where J :— ]a, b] and J = [a, b[. 


Definition 4 Let f, g: [a, b] — R be càdlàg functions. 
(3) Suppose that the following limit 


f g(s)d~ f(s) = lim n P Fi tui AM LE (5) 
[a,b] e>0t JR € 


exists and it is finite. Then, the obtained quantity is denoted by Jia b] gd f and called 
(deterministic, definite) forward integral of g with respect to f (on |a, b]). 
(ii) Suppose that the following limit 


€ 


f g(s)dt f(s) := lim n PELIS Pri ied: (6) 
[a,b] e>0t JR 


exists and it is finite. Then, the obtained quantity is denoted by Ji a.b] gd* f and called 
(deterministic, definite) backward integral of g with respect to f (on [a, b]). 
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The notation concerning this integral is justified since when the integrator f has 
bounded variation the previous integrals are Lebesgue-Stieltjes integrals on [a, b]. 


Proposition 3 Suppose f : [a,b] — R with bounded variation and g: [a,b] > R 
cadlag. Then, we have 


f PROE f at rO ca f gef (1) 
[a,b] [a,b] Ja,b] 
f sGd* f(s) = / £0dW) = 2OFO* f sGdfG). © 
[a,b] [a,b] Ja,b] 


Proof Identity (7). We have 


] 089209, - odi i fs te)ds 
R E € 


aü—6€ 


b — 
«f y, TUA) IS) is. 


€ 


(9) 


The second integral on the right-hand side of (9) gives, by Fubini's theorem, 


b 1 1 
f «e I afty) ds = f ( / eod aro 
a € J]s,(s+£)^b] ]a,b] \E J[av(y—&),y] 


g—0* 


= / g(y )df y). 
la.b] 


The first integral on the right-hand side of (9) goes to g(a) f (a) as € — O*, so the 
result follows. 


Identity (8). We have 


a+eé 


1 äte 
+ JI g(s) f (s)ds. (10) 


The second integral on the right-hand side of (10) goes to g(a) f (a) as e — 0*. The 
first one equals 


b 1 1 sot 
| «o f dfo))as = n (S eds arn = n odf O), 
ate € J]s—e,s] Ja,b] X8 J]y.Cy--e) AD] ]a,5] 


from which the claim follows. 


Functional and Banach Space Stochastic Calculi ... 35 


Let us now introduce the deterministic covariation. 


Definition 5 Let f, g: [a,b] — R be continuous functions and suppose that 0 € 
[a, b]. The (deterministic) covariation of f and g (on [a, b]) is defined by 


| f* 
Lf gl@) = lg, £160 = E 3 (f(s e) — f(s))(g(s +£) -e(sDds, — x € [a,b], 


if the limit exists and it is finite for every x € [a, b]. If f = g, we set [f] := [f. f] 
and it is called quadratic variation of f (on [a, b]). 


We notice that in Definition 5 the quadratic variation [ f] is continuous on [a, b], 
since f is a continuous function. 


Remark 1 Notice that if f is a fixed Brownian path and g(s) = g(s, f (s)), with 
ge C!({a, b] x R). Then m g(s)d f(s) exists for almost all (with respect to 
the Wiener measure on C ([a, b])) Brownian paths f. This latter result can be shown 
using Theorem 2.1 in [26] (which implies that the deterministic bracket exists, for 
almost all Brownian paths f, and [f](s) = s) and then applying Itó's formula in 
Theorem 1 above, with IP given by the Dirac delta at a Brownian path f. 


We conclude this subsection with an integration by parts formula for the deter- 
ministic forward and backward integrals. 


Proposition 4 Let f: [a,b] — R be a càdlàg function and g: [a,b] —> R bea 
bounded variation function. Then, the following integration by parts formulae hold: 


/ g(s)d f(s) = soro- | f (s)dg(s), (11) 
[a,b] Ja,b] 
f g(s)d* f(s) = gb) fio) -f f(s” )dg(s). (12) 
[a,b] ]a.b] 


Proof Identity (11). The left-hand side of (11) is the limit, when e > 0*, of 


1 sore 1 [b 1 fÈ 1 f4 
a f g(s)f (s + &e)ds — E n g(s) f (s)ds + F f g(s) f (D)ds + A T g(a) f (s + e)ds. 
a —£ a—é 


E€ Ja 


This gives 


1 b 1 b 1 b 1 a 
z[s6-2fous- i] sores lf sors lf gofe+ods 
acte a b—& a—e& 


€ 
b _ = 1 ade 1 b 
J 8(s)—8(s— &) f(s)ds E f g(s) f(s)ds + JI g(s) f (b)ds 
? a b—e 


+e E 


1 a 
+ " / g(a) f (s 4- e)ds. 
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We see that 


£—0* 


1 b 

3 g(s)f(b)ds — g(b )f(b), 
1 4 1 are e sot 
Ji g(a) f (s 4- e)ds — JI g(s)f(s)ds — 0. 


Moreover, we have 


b Dae b 
jes P) f(s)ds = | dsf(s)~ [480 
ate € ate E€ J]s—e,s] 


1 peagrte) eoot 
=- n dgy) | fen € < n YI 
Ja,b] € Jyv(ate) Ja,b[ 


In conclusion, we find 


f g(s)d f(s) 
[a,b] 


-] " dg(y) f(y) + (g(b) — g(b )) f(b) + g(b )f (b) 


-f dg(y) f(y) + gb) f (b). 
Ja,b] 
Identity (12). The left-hand side of (12) is given by the limit, as € > OT, of 


1 b 1 b 1 b 1 b—e& 
- f g(s) f (s)ds — Ji g(s)f(s—s)ds = - ri g(s) f (s)ds — Ji g(s 4- e) f (s)ds 
€ Ja € Jate E Ja € Ja 


b—e& b 
a af” iro 19. 2 
€ & 


a 


g(s) f (s)ds 


b-e 


The second integral on the right-hand side goes to g(b~) f (b7) as e > O*. The first 
integral expression equals 


Fi — g4 a b u 
=) pa a. sæ | gs+edst | f(y $9 9, 
R € E aze jus n 


a sj " f(s” )dg(s) — f(a)g(a) + f (a)g(a) + (g(b) — g(b ) f (b). 


a 


taking into account identity (7). This gives us the result. 


2.2 The Spaces € ([-T, 0]) and €([—T, 0D 


Let C([—T, 0]) denote the set of real continuous functions on [—T, 0], endowed with 
supremum norm ||7)|loo = Supe. 7. o; [n (x), for any n € C([—T, 0]). 
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Remark 2 We shall develop functional Itó calculus via regularization firstly for time- 
independent functionals ~% : C([—T, 0]) — R, since we aim at emphasizing that in 
our framework the time variable and the path play two distinct roles, as emphasized 
in the introduction. This, also, allows us to focus only on the definition of horizontal 
and vertical derivatives. Clearly, everything can be extended in an obvious way to 
the time-dependent case Y : [0, T] x C([-T,0]) — R, as we shall illustrate later. 


Consider a map 4 : C([—T, 0]) — R. Our aim is to derive a functional Itó's for- 
mula for Y . To do this, we are led to define the functional (i.e., horizontal and vertical) 
derivatives for Y in the spirit of [5, 17]. Since the definition of functional derivatives 
necessitates of discontinuous paths, in [5] the idea is to consider functionals defined 
on the space of càdlàg trajectories D([—T, 0]). However, we can not, in general, 
extend in a unique way a functional Y defined on C([—T, 0]) to D({—T, 0]). Our 
idea, instead, is to consider a larger space than C([— T, 0]), denoted by € ([—T, 0]), 
which is the space of bounded trajectories on [— T, 0], continuous on [—7, O[ and 
with possibly a jump at 0. We endow @([—T, 0]) with a (inductive) topology such 
that C([—T, 0]) is dense in @([—T, 0]) with respect to this topology. Therefore, if 
WU is continuous with respect to the topology of € ([—T, 0]), then it admits a unique 
continuous extension u: @([—T, 0]) — R. 


Definition 6 We denote by @([—T, 0]) the set of bounded functions 5: [—T, 0] 
— R such that 7 is continuous on [—T, O[, equipped with the topology we now 
describe. 

Convergence We endow @([—T, 0]) with a topology inducing the following con- 
vergence: (Nn)n converges to n in @([—T, 0]) as n tends to infinity if the following 
holds. 


Q) |Innlloo < C, for any n € N, for some positive constant C independent of n; 
Gi) sup,c |n, (x) — n(x)| — 0 as n tends to infinity, for any compact set K C 
[-T, O[; 


(iii) 94 (0) — (0) as n tends to infinity. 
Topology For each compact K C [— T, O[ define the seminorm px on € ([—T, 0]) by 


pkQDn = sup [nOD nOD, Vn e €([-T, 0]. 
xE 


Let M > 0 and €«([—T,0]) be the set of functions in @([—T,0]) which are 
bounded by M. Still denote px the restriction of px to 6m ([—T, 0]) and consider 
the topology on € ([—T, 0]) induced by the collection of seminorms (px) x. Then, 
we endow @([—T, 0]) with the smallest topology (inductive topology) turning all 
the inclusions iy: @y([—T, 0]) ~ €([—T,0]) into continuous maps. 
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Remark 3 (i) Notice that C([—T, 0]) is dense in &([—T, 0]), when endowed with 
the topology of @([—T, 0]). As a matter of fact, let n € @([—T, 0]) and define, for 
any n € N\{0}, 


n(x), -T <x<-lI/n, 
Qn(x) = 
n(n(0) — n(—1/n))x + (0), —1/n < x <0. 


Then, we see that o, € C([—T, 0]) and gn — n in @([—T, 0]). 
Now, for any a € R define 


Ca([-T, 0) := (n € C(—T, 01): n(0) = a}, 
@a([—T, 0)) := (n € €(—-7,0p: n0) = a}. 


Then, Cg({[—T,0]) is dense in @j([—T,0]) with respect to the topology of 
SAT— T. 0p. 

(ii) We provide two examples of functionals Y : C([—T, 0]) — R, continuous with 
respect to the topology of @([—T, 0]), and necessarily with respect to the topology 
of C([—T, 0]) (the proof is straightforward and not reported): 


(a) Y(n) = gati). .... (5). for all n e C([-T, 0]), with -T € t <--- < 
tn < 0 and g: R” — R continuous. 

(b Y(n) = Jiro o(x)d n(x), for all n € C([-7,0p, with g: [0, T] > Ra 
càdlàg bounded variation function. Concerning this example, keep in mind that, 
using the integration by parts formula, 7/ (7) admits the representation (11). 


(iii) Consider the functional Y (n) = SUP. e[. T. 0] n(x), for all n € C([—T,0]). It 
is obviously continuous, but it is not continuous with respect to the topology of 
€ ([—T, 0]). As a matter of fact, for any n € N consider n, € C([—T, 0]) given by 


0, -T <x < 2n > 
2nl T T 
M(x) = yap x+2, -a < XST 
gntl 
T , n+l <x< 0 


Then, Y (na) = SUD. ep. T. 0] In (x) = 1, for any n. However, nn converges to the zero 
function in 4 ([—T, 0]), as n tends to infinity. This example will play an important 
role in Sect.3 to justify a weaker notion of solution to the path-dependent semilinear 
Kolmogorov equation. 


To define the functional derivatives, we shall need to separate the “past” from the 
"present" of n € @([—T, 0]). Indeed, roughly speaking, the horizontal derivative 
calls in the past values of 7, namely (n(x): x € [—T, O[}, while the vertical derivative 
calls in the present value of 7, namely (0). To this end, it is useful to introduce the 
space @([—T, O[). 
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Definition 7 We denote by @({—T, O[) the set of bounded continuous functions 
y: [-T,0[ — R, equipped with the topology we now describe. 

Convergence We endow @([—T, O[) with a topology inducing the following con- 
vergence: (y4), converges to y in @([—T, O[) as n tends to infinity if: 


(i) supyep r op Yn | < C, for any n € N, for some positive constant C indepen- 
dent of n; 

Gi) supyex lya (x) — y (x)| — 0 as n tends to infinity, for any compact set K C 
[-T, Of. 


Topology For each compact K C [—T, O[ define the seminorm qx on € ([—T, O[) by 
qk(y) = sup Iy 00; Vy € @([—T, OD. 
XE 


Let M > 0 and @y([—T, O[) be the set of functions in @([—T, O[) which are 
bounded by M. Still denote qx the restriction of gx to @y({—T, O[) and consider 
the topology on 6m ([—T, O[) induced by the collection of seminorms (qx) x. Then, 
we endow @([—T, O[) with the smallest topology (inductive topology) turning all 
the inclusions iy: @y([—T, 0D —^ €([—T, Of) into continuous maps. 


Remark 4 (i) Notice that € ([—T, 0]) is isomorphic to @([{—T, O[) x R. As a matter 
of fact, it is enough to consider the map 


J: €([-T,0]) — «€([-T, OD x R 
n e Giro. (0). 


Observe that J7!: v([-T,O[) x R — ([—T,0]) is given by J~!(y,a) = 
Y lp-r.o + alto). 

(ii) € ([—T, 0]) is a space which contains C([—T, 0]) as a dense subset and it has the 
property of separating "past" from "present". Another space having the same property 
is L'([-T, 0]; d u) where u is the sum of the Dirac measure at zero and Lebesgue 
measure. Similarly as for item (i), that space is isomorphic to L?([-T, 0p x R, 
which is a very popular space appearing in the analysis of functional dependent (as 
delay) equations, starting from [4]. 


For every u: @([—T,0]) — R, we can now exploit the space @([—T, O[) to 
define a map à: @([—T, OD x R — R where “past” and "present" are separated. 


Definition 8 Let u: @({[—T, 0]) —> R and define 4: &([—T,O[) x R > Ras 
ü(y,a) = u(ylpropalgjp,  VG5a)eV(—T,OD xR. — (13) 


In particular, we have u(y) = ü(nj—r.or, n(0)), for all n € € ([—T, 0]). 
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We conclude this subsection with a characterization of the dual spaces of 
«([—T,0]) and @({—T, O[), which has an independent interest. Firstly, we need 
to introduce the set Z ([—T, 0]) of finite signed Borel measures on [— T , 0]. We also 
denote ,Zo([—T, 0]) C 4 ([-T, 0]) the set of measures u such that j,((0]) = 0. 


Proposition 5 Let A € @([—T,0])*, the dual space of € ([—T, 0]). Then, there 
exists a unique  € M ([—T,0]) such that 


An — | nG)u(dx), Wn € @((-T,0)). 
[-T.0] 


Proof Let A € &([—T, 0])* and define 


Ag := Ag, | Voc C([-T,0]). 


Notice that A: C ([—T,0]) — R is a continuous functional on the Banach space 
C([—T, 0]) endowed with the supremum norm || - ||oo. Therefore Ae C([-T,0p* 
and it follows from Riesz representation theorem (see, e.g., Theorem 6.19 in [36]) 
that there exists a unique u € JZ ([—T, 0]) such that 


Ag zii eG)u(dx)  VeeC([-T,0]. 
[-T,0] 


Obviously A is also continuous with respect to the topology of @([—T, 0]). Since 
C([—T, 0]) is dense in @([—T, 0]) with respect to the topology of @([—T, 0]), we 
deduce that there exists a unique continuous extension of Ato « ({—T, 0]), which is 
clearly given by 


An = J n(x)u(dx), Yn € €([-T, 01). 
[-T.0] 


Proposition 6 Let A € C ([—T,0D*, the dual space of @({—T, O[). Then, there 
exists a unique p € JKo([—T,0]) such that 


Ay zx y(x)u(dx), Yy e €([-T, OD. 
[-T.0[ 


Proof Let A € &([—T, O[)* and define 


An := A(ny-ro), Vm € €([—T,0]). (14) 
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Notice that A: @({—T, 0]) — Risacontinuous functional on @ ([—T, 0]). It follows 
from Proposition 5 that there exists a unique u € -W([—T, 0]) such that 


An = f n(x)u(dx) = f n(x)u(dx) + nOu d0), Yn € €(—T,0]). 
[-T,0] [-T,0[ 
(15) 
Let nı, n2 € VIT, 0]) be such that nı 1j-7,o[ = 72 1,—7,0;. Then, we see from (14) 
that An; = Aro, which in turn implies from (15) that u({0}) = 0. In conclusion, 
u € M([—T, 0]) and A is given by 


Ay = f y(x)u(dx), Yy € €([-T, OD. 
[-T.0[ 


2.3 Functional Derivatives and Functional It6’s Formula 


In the present section we shall prove one of the main result of this section, namely 
the functional Itó's formula for 7: C([-T,0]) — R and, more generally, for 
7: [0, T] x C(—7,0]) — R. We begin introducing the functional derivatives, 
firstly for a functional u: @([—T, 0]) — R, and then for Z: C([-T,0]) > R. 


Definition 9 Consider u: @([—T, 0]) — R and y e €([—T, 0]). 
(i) We say that u admits the horizontal derivative at 7 if the following limit exists 
and it is finite: 


s u(C)lp-r,or + 70) lto) — uN — €) 1-70, + nO o) 
pi € 


Di u(y) := li 

E> 0 

(16) 

(iY Let à be as in (13), then we say that uw admits the horizontal derivative at 
(y, a) € €([—T, OD x R if the following limit exists and it is finite: 


ü(yC),a) — ü(y C — €), a) 
€ 


D'ü(y,a):— lim (17) 
g—0* 


Notice that if DP um) exists then D ü (np rot. 1(0)) exists and they are equal; 
viceversa, if Diy, a) exists then DP u (y 1j r,op + alqo) exists and they are equal. 
(ii) We say that u admits the first-order vertical derivative at 7 if the first-order 


partial derivative 95 4 (N-ro; n(0)) at (N-ro, n(0)) of ù with respect to its second 
argument exists and we set 


DY u(y) := 94üm-r.op n()). 
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(iii) We say that u admits the second-order vertical derivative at 7 if the second- 
order partial derivative at (7\(~7,0,, n(0)) of u with respect to its second argument, 
denoted by 92 QT. op 1(0)), exists and we set 


DY’ u(y) :— 32, ü(mp-r. ot, n(0)). 


Definition 10 We say that u: @({—T, 0]) > R is of class €? (past x present) if 
(i) u is continuous; 
(ii) Du exists everywhere on @([—T, 0]) and for every y € @([—T, O[) the map 
(e,a) — DP'ü(y(-—52),a),  (&,a) € [0, oo[ x R 


is continuous on [0, co[ xR; 
(iii) DY u and DY u exist everywhere on @([—T, 0]) and are continuous. 


Remark 5 Notice that in Definition 10 we still obtain the same class of functions 
E1? (past x present) if we substitute point (ii) with 


(i) Du exists everywhere on € ([—T, 0]) and for every y € @([—T, O[) there 
exists 6(y) > 0 such that the map 


(e,a) — D'ü(y(-—62),a), | (ea) € [0, co[xR (18) 


is continuous on [0, ó(y)[xIR. 


In particular, if (ii') holds then we can always take ó(y) = oo for any y € 
€ ([—T, OD, which implies (ii). To prove this last statement, let us proceed by con- 
tradiction assuming that 


ay) = sup {6(y) > 0: the map (17) is continuous on [0, d(y)[xR} « oo. 
Notice that ó*(y) is in fact a max, therefore the map (18) is continuous on 
[0, 8* (y) [xIR. Now, define y(-) :— y C — ó*(y)). Then, by condition (ii^) there 
exists 6(y) > 0 such that the map 


(e,a) — DPü(y( — £), a) = DP ùy — e — 8*(y)), a) 


is continuous on [0, 6(7)[xR. This shows that the map (18) is continuous on 
[0, 8* (y) + 6(y)[ XR, a contradiction with the definition of 8* (y). 


We can now provide the definition of functional derivatives fora map Y : C([—T, 0]) 
— R. 


Definition 11 Let Y: C([-T,0]) — R and n € C([—T, 0]). Suppose that there 
exists a unique extension u: @([—T, 0]) > Rof Y (e.g., if Y is continuous with 
respect to the topology of  ([—T, 0])). Then we define the following concepts. 
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(i) The horizontal derivative of 7/ at n as: 
D UM) = DM u(y). 
(ii) The first-order vertical derivative of 7/ at 7 as: 
DYU (n) := D'uG). 
(iii) The second-order vertical derivative of Y at 7 as: 
DYUM) = D''u(p. 


Definition 12 We say that Z/ : C(.—T,0]) — R is Cl? (past x present) if Y admits 
a (necessarily unique) extension u: @([—T, 0]) — R of class €! (past x present). 


Theorem 2 Let Y: C([-T,0]) — R be of class C? (past x present) and 
X = (Xp)iepo,r] be a real continuous finite quadratic variation process. Then, the 
following functional Itó's formula holds, P-a.s., 


t t t 
UK) = Uo) + f Dia cds « | DYU (X,)d- X, + >| DYV 4 (X, [X], 
0 0 0 


(19) 
for all 0 € t < T, where the window process X was defined in (2). 


Proof Fix t € [0, T] and consider the quantity 


Io(e,t) = [ K NN ru U (X,)ds — JI d/(X,)ds, 820. 
0 E E Jt € Jo 

Since the process (77 (X,))s>0 is continuous, /o(e, t) converges ucp to Y (X;) — 

U (Xo), namely supo, 7 ole, t) — (Y (Xr) — Y (Xo))| converges to zero in prob- 

ability when e — 0* . On the other hand, we can write J (e, t) in terms of the function 

ii, defined in (13), as follows 


ds. 


Wene f ü(Xs+el[-T,0D Tun — U(Xs[-7, 0, Xs) 
0 


Now we split /o(e, t) into the sum of two terms 


ies [ ü(Xs+ell-T,0 Xs+e) — U(Xsip—7, 0, Xste) ge (20) 
0 


€ 
Xr. ,X — ü(X Sr T. or, X. 
bens f CKsi[— T, 0 uo) CXsi- Top; Xs) 
0 


ds. (21) 
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We begin proving that 


t 
discs f D™Y(X,)ds. (22) 
e>0t Jo 
Firstly, fix y € € ([-T, O[) and define 
$(se,a) := u(y(-— &),4), (e, a) € [0, oo[ xR. 


Then, denoting by 3$ ¢ the right partial derivative of ¢ with respect to £ and using 
formula (17), we find 


ole +1, a) — $ (e, a) 
m 


af o(e,a) = li 


r>0t r 
FM u(y(.— €),@) — á(yC—e—r).a) 
= im 

r—0* r 


—D'ü(y(.—s),a)  V(s,a) € [0, oo[ xR. 


Since u € €! (past x present), we see from Definition 10(11), that oF ¢ is continuous 
on [0, co[ xR. It follows from a standard differential calculus result (see for example 
Corollary 1.2, Chap. 2, in [32]) that @ is continuously differentiable on [0, co[xR 
with respect to its first argument. Then, for every (e,a) € [0, co[xR, from the 
fundamental theorem of calculus, we have 


o(e,a) — $(0,a) = i Jeb (r, a)dr, 
0 
which in terms of ii reads 
ü(yC),a)-üu(yC—8).a) = n DP'ü(y(. — r), a)dr. (23) 


Now, we rewrite, by means of a shift in time, the term 7; (e, t) in (20) as follows: 


c n ü Xs op Xs) — 8CX eror Xs) is 
0 € 
tte WORT or, Xv) — BOR er- T.0p X 
«f üCXsg—T,op, Xs) — 4 CXs— e T,0[ s) qs 
t € 
A uü(Xsg-r,op, Xs) — u (Xs-—el[-T,0 Xs) 
0 


ds. (24) 
E 


Plugging (23) into (24), setting y = X;,a = Xs, we obtain 
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t 1 E : 
I(e,t) = f (f DP ù(Xs-r[-T,0b Xodr us 
0 € 0 
t+é 1 E 
+4 (J DP iX, rar. Xodr as 
f 0 


€ 1 & 5 
-f (J D/'iQX,. rap X our as. (25) 
0 € 0 
Observe that 


ty E t 
[+ DM HC jg, Xdr )ds = ri D u(X,)ds. 
0 € 0 e>0t Jo 


Similarly, we see that the other two terms in (25) converge ucp to zero. As a conse- 
quence, we get (22). 

Regarding 7»(e, t) in (21), it can be written, by means of the following standard 
Taylor's expansion for a function f € C?(IR): 


1 
O= fla) + f'ab a) + 5f OO- ay 
1 
$ n (1 — a)(/" (a c a(b — a) — f" (a) (6 — afda, 
0 


as the sum of the following three terms: 


Xste = Xs 


t 
Ii(e, t) = ri ða CXsi m op, Xs) ds 
0 


€ 


(Xste n X9^ e 


1 f! 7 
Ing (€,t) = F 92 ü (Xsl-T,0 Xs) P 
0 


t 1 
ba(s, f) = 1 (/ (A — o) (82, Gs op Xs + «(X spe — Xs)) 
0 0 
dre 
— 92 ü (Xt rop Xs)) Ce A dads 


By similar arguments as in Proposition 1.2 of [39], we have 


1 


t E 1 t 
In(e,t) —> = f 92 ü (Xr oi, X)d[X]; = = n D'VuptogiX]s 
e >0t 2 Jo 2 Jo 


Regarding /23(e, t), for every w € £2, define y: [0, T] x [0, 1] x [0, 1] — Ras 


Vo, a, £) := (1 — 0)82,  (Xsip crop (9); Xs (9) + a(Xs4e(@) — Xs(@))), 
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for all (s, à, €) € [0, T] x [0, 1] x [0, 1]. Notice that Wa is uniformly continuous. 
Denote py,, its continuity modulus, then 


T 2 
X —X 
sup |h3(e,t)| < n p Xu sta ™ Xe) ae 
te[0,T] 0 € 


Since X has finite quadratic variation, we deduce that /23(e, t) > O0 ucp as e > OF. 


Finally, because of Jo(e, t), Iı (e, t), I2» (e, t), and I23(e, t) converge ucp, it follows 
that the forward integral exists: 


t t 
ucp " = = 
hile, t) —> f dal (Xs. rop, Xs)d X, = n D'u(X,)d- X,;, 
e>0t Jo 0 


from which the claim follows. 


Remark 6 Notice that, under the hypotheses of Theorem 2, the forward integral 
IA DV Y (X,)d- X; exists as a ucp limit, which is generally not required. 


Remark 7 The definition of horizontal derivative. Notice that our definition of hor- 
izontal derivative differs from that introduced in [17], since it is based on a limit on 
the left, while the definition proposed in [17] would conduct to the formula 


D to ccn u(nC + &)lu-r op n(0)) — (nC) l-r, n) 


e—>0+t E 


(26) 


To give an insight into the difference between (16) and (26), let us consider 
a real continuous finite quadratic variation process X with associated window 
process X. Then, in the definition (26) of D'.*u(X,) we consider the incre- 
ment Z (X;-r,o[C + £), Xi) — CX, op; Xt), comparing the present value of 
u(X;) = ü(Xip-r,op, Xr) with an hypothetical future value 4 (X; rop + £), X+), 
obtained assuming a constant time evolution for X. On the other hand, in our defi- 
nition (16) we consider the increment i (X; oj, Xt) — 4X; er, op, Xt), where 
only the present and past values of X are taken into account, and where we also 
extend in a constant way the trajectory of X before time 0. In particular, unlike (26), 
since we do not call in the future in our formula (16), we do not have to specify 
a future time evolution for X, but only a past evolution before time 0. This differ- 
ence between (16) and (26) is crucial for the proof of the functional Itó's formula. 
In particular, the adoption of (26) as definition for the horizontal derivative would 
require an additional regularity condition on u in order to prove an Itó formula for 
the process f +» u(X;). Indeed, as it can be seen from the proof of Theorem 2, to 
prove Itó's formula we are led to consider the term 


ned ri ü(Xs+el[-T,0[ Xste) — ü(XsI[-T,0[ Xste) J. 
0 € 
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When adopting definition (26) it is convenient to write Jı (e, t) as the sum of the two 
integrals 


In.) = i IOS ce Top Xst¢e) — ROS To C + £), Xste) s. 
0 € 


Ii»(6, t) — [ üCXsi- ToC + £), Xs+e) = i CXs[— T, of. Xste) s. 
0 E 


It can be shown quite easily that, under suitable regularity conditions on u (more 
precisely, if u is continuous, D® +u exists everywhere on @([—T, 0]), and for every 
y € €([—-T, OD the map (e, a) — DP-*ü(y (-+ £), a) is continuous on [0, co) x 
R[, we have 


t 
sitet). o5 f DH w(X,)ds. 
e>0t Jo 


To conclude the proof of Itó's formula along the same lines as in Theorem 2, we 
should prove 


Inet) 6: (27) 
g—0* 


In order to guarantee (27), we need to impose some additional regularity condition 
on ii, and hence on u. As an example, (27) is satisfied if we assume the following 
condition on uw: there exists a constant C > 0 such that, for every € > 0, 


lä(yı, a) — u(y2,a)| x Ce sup |y Œ) — yœ), 


xe[—e,0[ 


for all y1, yo € €([—T, 0D and a € R, with yj (x) = y2(x) for any x € [- T, —e]. 
This last condition is verified if, for example, i is uniformly Lipschitz continuous 
with respect to the L '({[—T, 0])-norm on €'([—T, Of), namely: there exists a constant 
C > O such that 


li(1,4) — &(yo, a)] < ef 


Da Go) — y2(x)|dx, 
of 


for all yj, y2 € @({[—T, O[) and a € R. 


We conclude this subsection providing the functional Itó's formula for a map 
U : [0, T] x C([- T, 0]) — R depending also on the time variable. Firstly, we notice 
that for a map X : [0, T] x C([-T,0]) > R (resp. u: [0, T] x €([-T,0]) > R) 
the functional derivatives DP Y, DV Y, and DV Y (resp. DP u, DV u, and DV u) 
are defined in an obvious way as in Definition 1 1 (resp. Definition 9). Moreover, given 
u: [0, T] x @({[—T, 0]) — R we can define, as in Definition 8, a map i: [0, T] x 
«€ ([—T, 0D x R — R. Then, we can give the following definitions. 
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Definition 13 Let / be [0, T [ or [0, T]. We say that u: I x @([—T, 0]) > R is of 
class ?((I x past) x present) if the properties below hold. 


(i) u is continuous; 
(ii) 3u exists everywhere on J x @([—T, 0]) and is continuous; 
(iii) Du exists everywhere on J x @([—T, 0]) and for every y € @([—T, O[) the 
map 


(t,,a) — D'ü(t,y—62),a, — (t,e,a) € I x [0, oo[ xR 


is continuous on J x [0, oo[ xR; 
(iv) DV u and DV” u exist everywhere on J x @([—T, 0]) and are continuous. 


Definition 14 Let / be [0, T[ or [0, T]. We say that Z/: I x C([—T,0]) — Ris 
ch x past) x present) if % admits a (necessarily unique) extension u: J x 
«€ ([—T,0]) > R of class @!7((I x past) x present). 


We can now state the functional Itó's formula, whose proof is not reported, since 
it can be done along the same lines as Theorem 2. 


Theorem 3 Let 4 : [0, T] x C([-T,0]) > R be of class C^?(([0, T] x past) x 
present) and X = (Xy)iepo,r] be a real continuous finite quadratic variation process. 
Then, the following functional Itó's formula holds, P-a.s., 


t t 
U(t, XÀ = UO, xo+ | (81% (5.5) + DEW (5, %s))ds-+ f DY X(s,Xs)d7 X; 
0 0 
t 
+ ;] DVV 4y (s, Xy)d[X];, (28) 
0 


for all 0 € t € T. 


Remark 8 Notice that, as a particular case, choosing Y(t, n) = F (t, n(0)), for any 
(t, n) € [0, T] x C({—T, 0], with F € C'?([0, T] x R), we retrieve the classical Itó 
formula for finite quadratic variation processes, i.e. (4). More precisely, in this case 
7/ admits as unique continuous extension the map u: [0, T] x €([-T,0]) —> R 
given by u(t, n) = F(t, n(0)), for all (t, n) € [0, T] x €([—T, 0]). Moreover, we 
see that D?Y = 0, while DYXY = 3 F and DYYY = 82 F, where 0, F (resp. 
a? ,, F) denotes the first-order (resp. second-order) partial derivative of F with respect 
to its second argument. 


2.4 Comparison with Banach Space Valued Calculus 
via Regularization 


In the present subsection our aim is to make a link between functional It6 calculus, as 
derived in this paper, and Banach space valued stochastic calculus via regularization 
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for window processes, which has been conceived in [13], see also [12, 14—16] for 
more recent developments. More precisely, our purpose is to identify the building 
blocks of our functional Itó's formula (19) with the terms appearing in the Itó formula 
derived in Theorem 6.3 and Sect. 7.2 in [12]. While it is expected that the vertical 
derivative DY Y can be identified with the term p» U of the Fréchet derivative, it is 
more difficult to guess to which terms the horizontal derivative D" Y corresponds. 
To clarify this latter point, in this subsection we derive two formulae which express 
DP Y in terms of Fréchet derivatives of Y. 

Let us introduce some useful notations. We denote by BV([—T, 0]) the set of 
càdlàg bounded variation functions on [—T, 0], which is a Banach space when 
equipped with the norm 


Ilnllavqor.op = In] + lImllvaq-r.op. 1 € BV(—T,0p. 


where ||n|lvarq-T,op = Idn|([—T, 0]) and |dn] is the total variation measure associ- 
ated to the measure dn € .@([—T, 0]) generated by y: dn(] — T, —1]) = n(-t) — 
n(—T), t € [—T, 0]. We recall from Sect.2.1 that we extend n € BV([—T,0]) to 
all x € R setting n(x) = 0, x < —T, and n(x) = n(0), x > 0. Let us now introduce 
some useful facts about tensor products of Banach spaces. 


Definition 15 Let (E, || - || p) and (F, || - || r) be two Banach spaces. 

(i) We shall denote by E & F the algebraic tensor product of E and F, defined as 
the set of elements of the form v = $77 , ej Q fi, for some positive integer n, where 
e € E and f € F. The map 8: Ex F > EG F is bilinear. 

(ii) We endow E ® F with the projective norm zr: 


n 


ro) := int | 3 letele : Xe]. Yve EQF. 


i=l i=l 


(iii) We denote by EG F the Banach space obtained as the completion of E & F for 
the norm zr. We shall refer to EG); F as the tensor product of the Banach spaces 
E and F. 

(iv) If E and F are Hilbert spaces, we denote EG F the Hilbert tensor product, 
whichis still a Hilbert space obtained as the completion of EG F for the scalar product 
le & f', e" & f") := (e,e")g(f', f")p,forany e',e" e E and f', f" € F. 

(v) The symbols E M and eG? denote, respectively, the Banach space EG E and 
the element e & e of the algebraic tensor product E & E. 


Remark 9 (i) The projective norm z belongs to the class of the so-called reasonable 
crossnorms a on E & F, verifying a (e & f) = ell zl flr. 

(ii) We notice, proceeding for example as in [16] (see, in particular, formula (2.1) in 
[16]; for more information on this subject we refer to [41]), that the dual (E @, F)* of 
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EG, F is isomorphic to the space of continuous bilinear forms Zi (E, F), equipped 
with the norm || - || z, 7 defined as 


IIE, F :— sup |G, f), V9 € Zi(E, F). 
ecE,feF 
lele. flet 


Moreover, there exists a canonical isomorphism between Zi(E, F) and L(E, F*), 
the space of bounded linear operators from E into F*. Hence, we have the following 
chain of canonical identifications: (EG F)* = Bi(E, F) = L(E; F*). 


Definition 16 Let E be a Banach space. We say that Y : E — R is of class C? (E) 
if 


(i) DY, the first Fréchet derivative of Y, belongs to C(E; E*) and 
(ii) D? Z, the second Fréchet derivative of Y, belongs to C(E; L(E; E*)). 


Remark 10 Take E = C([—T, 0]) in Definition 16. 
(i) First Fréchet derivative DW. We have 


DY: C([-T, 0] > (C([-7,0])* X . Ip ([—-T, 0]. 


For every n € C([-T,0], we shall denote Dax% (5) the unique measure in 
MM ([—T, 0]) such that 


D^ (pe = f 


" eQ)Dax (p, | Ve e C([-T,. Op. 


Notice that -@([—T, 0]) can be represented as the direct sum: JZ([—-T,0]) = 
"Ig([—T, 0] 9 A, where we recall that Zo ([—T, 0]) is the subset of. ([—T, 0]) of 
measures u such that jL((0]) = 0, instead Yo (which is a shorthand for Zo ([— T, 0])) 
denotes the one-dimensional space of measures which are multiples of the Dirac mea- 
sure ôo. For every n € C([—T,0]) we denote by (DEU (n), DÈ M (n) the unique 
pair in .4([—T, 0]) & Y such that 


Dax%(n) = Dj; ^r M) + DRY D. 
(ii) Second Fréchet derivative DY. We have 


D?4: C([-T. 0] — L(C(I-T, 0); (C(— T, 0D)*) = Bi(C(-T, 0), CM-T, 0D) 
= (C(-T,0)8;s C(—T, 0))*, 


where we used the identifications of Remark 9(ii). Let n € C([—T,0]); a typical 
situation arises when there exists Dax ay Y (n) e AJ ([—T, 0]?) such that D?Y (n) € 
L(C([-T, 0); (C([—T, 0]))*) admits the representation 
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DUNG. Y) = J op 100 O Dasay U), Vo, V € C(-T, 0]. 


Moreover, Dax dy Y (n) is uniquely determined. 


The definition below was given in [13]. 


Definition 17 Let E bea Banach space. A Banach subspace (x, ||: ||; ) continuously 
injected into (EQ y, i.e., ||- lly = dl 


(EQ? )*). 


Remark 11 Take E — C([—T,0]) in Definition 17. As indicated in [13], a typi- 


cal example of Chi-subspace of C([—T, op&? is JL ([- T, 0]*) equipped with the 
usual total variation norm, denoted by || - || va. Another important Chi-subspace of 


C([—T, 0] a. is the following, which is also a Chi-subspace of .@([—T, 01°): 


T will be called a Chi-subspace (of 


xo:— {ue M-T, 012): (dx, dy) = g1 (x, y)dxdy + 180(dx) & óo(dy) 
+ 22(x)dx Q X56o(dy) + 4369(dx) & g3(y)dy + 24(x)8y (dx) & dy, 
£i € L” (I-T, 0P), 82, 93 € L'(I- T, Ol), g4 € L (I-T, 0), 11,32, 43 € R}. 


Using the notations of Example 3.4 and Remark 3.5 in [16], to which we refer for 
more details on this subject, we notice that xo is indeed given by the direct sum xo — 
L*([-T,0P) e (L°{-T, 0) @n%) © (Zo&&L^(—-T, 0) e oo(—T, OP) © 
Diag([—T, 0]?). In the sequel, we shall refer to the term g4(x)8y (dx) & dy as the 
diagonal component and to g4(x) as the diagonal element of ju. 


We can now state our first representation result for D" Y. 


Proposition 7 Let Y: C([-T,0]) — R be continuously Fréchet differentiable. 
Suppose the following. 


(i) For any n € C([—-T,0]) there exists DEY (n) € BV([—T,0] such that 
Dl (y = DEU (nx. 
(ii) There exist continuous extensions (necessarily unique) 


u: €([-T,0] > R, D“u: &([-T,0] ~ BV([-T,0] 


of Y and DEU, respectively. 
Then, for any n € C([-T,0]), 


DUM) = J 


" DEU Md (x), (29) 
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where we recall that the previous deterministic integral has been defined in Sect. 
2.1.1. In particular, the horizontal derivative DP Y (n) and the backward integral 
in (29) exist. 


Proof Let n € C([—T,0]) then starting from the left-hand side of (29), using 
the definition of DP Y (7), we are led to consider the following increment for the 
function u: 


u(n) — uC — &e)lp-r. op + n(0)1(9)) 
€ 


(30) 


We shall expand (30) using a Taylor's formula. Firstly, notice that, since Y is C! 
Fréchet on C([—T, 0]), for every nı € C([—T, 0]), with nı(0) = 5(0), from the 
fundamental theorem of calculus we have 


1 0 
4 (Q) — V (n) = 1 (J Pevar -1O = menda. 


Recalling from Remark 3 the density of C0) ([—T, 0]) in 60) ([L—T, 0]) with respect 
to the topology of € ([—T, 0]), we deduce the following Taylor's formula for u: 


1 0 
u(n) —u(m) = Í (f, Diu(n + Xi — 9)(G) — ned d. (31) 


for all 4 € G0) ([—T, 0]). As a matter of fact, for any 6 €]0, T /2] let (similarly to 
Remark 3(1)) 


ni(x), -T < x < —ó, 
71,5 (X) = 1 

z110) — mC)x +m (0), —d <x <0 
and 71,9 :— nı. Then gj, € C([—T,0]), for any 6 €]0, 7/2], and n1,5 — mi in 
@ ({—T, 0), asó — 0*. Now, define f : [2 T, 0] x [0, 1] x [O, 7/2] — Ras follows 


f(x, 4,6) = D*u(n 4 A(m,s — GG) — mas), 


for all (x, à, ô) € [-T,0] x [0, 1] x [0, 7/2]. Now (å, ô) BH n + à (nı, — n), is 
continuous. Taking into account that Du: @([—T, 0]) — BV([—T,0]) is contin- 
uous, hence bounded on compact sets, it follows that f is bounded. Then, it follows 
from Lebesgue dominated convergence theorem that 
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1 0 
[Cf omms mao - macodx)ar 
0 =f 


- [f ront [fron 


1 
= n (m D¥u(n + A(m — 2)(QG) — n 


from which we deduce (31), since 2/(y1,5) — u(n1) as 6 > OF. Taking g1() = 
ne — £)lp-T,0[ + n (0) lio}, we obtain 


u(n) — u(n(- — &)lp-r.op + n(0)l{0}) 
€ 


1 0 — MN 
| ( n Dula taae- 8) - n) noro) = Max )an 


€ 
Kh, €) + D (9, €) * By, €), 


where 


1 0 1 
h(n, £) := n (/ n(x) =(D¥u(n (nC — €) — 1C))lt- r,or) 
0 -T E 


~ De stig baie = £) = n() 1-0) dx an 


1 1 0 
bs) - f ( / n(x) D* pu u(n X = 9) = 109) tras). 


=E 


1 -T 
(e): -> f ( J n(x) Du u(n Male = 2) =n) ran) ad. 


—-T-—& 


Notice that, since n(x) = 0 for x < —T, we see that /5(9, €) = 0. Moreover 
D*¥u(-) = Dyu().forx > 0, andn+A(n(-—e€) —n(-)) l-r, > nin @([—T, 0]) 
ase — O*. Since D'*y is continuous from 4 ([—T, 0]) into BV ([—T, 0]), we have 
Dun 4- nC — £) — nC)lp-r,oD > Difu(y) as e > OF. Then 


1 0 
L nax) DS .u(n +A (nC — £) — nC)) 1-701) dx 


1 j ac e>0r ac 
x zy n(x)dx Diyu(n + (nC — £) — nC) l-r) —> nDun). 


=g 


So h(n, e) > n(0) Doy u(n). Finally, concerning /; (7, £), from Fubini's theorem we 
obtain (denoting e, := n -- A(gC — €) — n(-))li-r,o0p 
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54 
Eg pt la. 
ined = | (f noz (ptm - D&.ulnea))ae)adr 
0 =f 
1 0 1 
= -f (/ T Dus) )dx)aa 
0 -T € ]x,x+e] ? 
1 1 OAy 
= -f (/ (J ndx) Diss) Ja 
0 ]-T.s] € (-T)v(y—&) 
= (n, €) + ho, £), 
where 
: 1 oy ac ac 
maa=- fff meres) (Rua - Digu) Ja 
1 1 OAy . 
2m, €) :— -f (/ : EU T" nenas ngu Jan 
—T,& —T)v(y—-e& 


pr Wwe 
= -(f (J vods) Din. 
]-T,e] € XJ (- T)v(y—6) ` 


Recalling that D*“u(-) = Diu), for x > 0, we see that in 711 (n, €) and I15(, €) 
the integrals on ]—T, £] are equal to the same integrals on ]—T, 0], i.e., 


: 1 ud ac ac 
Nid, £) = -f (J -(/ (oos) Dauner) — Dayu (n) E 
0 NJ1-T,0] € VJ(-T)v(y—6) ( MOM dy ) 
l 1 » ac ac 
E = vods) Dj, ule) — Dayu) o 
i Cee ( = dy ) 
PEL 
129, €) = -f : vods) Dit) 
]-T,0] € XJ (-T)v(y—&) . 


1 y : 
= =) no -(/ ndx) DW, 
-T, y-e 


Now, observe that 


: ot 
14, 8)| < [lnllool|D™ ue) — Du) Ilvarq—7,0]) Z o. 
Ium, D) < Wnllooll D uNe) — Dum) 


Moreover, since 7 is continuous at y €]—T, 0], we deduce that f m n(x)dx/e — 
n(y) as  — 07. Therefore, by Lebesgue's dominated convergence theorem, we get 


" 
I2, €) pesi -f 


n(y) Diu). 
]- T.0] i 
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So A(n, €) > — Jr 0] nO) Daun). In conclusion, we have 


DP4 (y = wODifv) - | 


n) Daun): 
0] 


Notice that we can suppose, without loss of generality, DEY (n) = DEU (n). 
Then, the above identity gives (29) using the integration by parts formula (12). 


For our second representation result of DP Z/ we need the following generaliza- 
tion of the deterministic backward integral when the integrand is a measure. 


Definition 18 Leta < bbetworeals. Let f : [a,b] — R be a càdlàg function (resp. 
càdlàg function with f(a) = 0) and g € .@([—T, 0]). Suppose that the following limit 


I aaro in a O a 

[a,b] 07 J[a,b] E 

(resp. f g(ds)d- f(s):— lim gay H8 3 9 7 392) (33) 
[a,b] £—0* J[a,b] € 


exists and it is finite. Then, the obtained quantity is denoted by Jr abi gd* f 
( Jia, bj gd f) and called (deterministic, definite) backward (resp. forward) inte- 
gral of g with respect to f (on [a, b]). 


Proposition 8 Jf g is absolutely continuous with density being càdlàg (still denoted 
with g) then Definition 18 is compatible with the one in Definition 4. 


Proof Suppose that g(ds) — g(s)ds with g càdlàg. 
Identity (32). The right-hand side of (6) gives 


b 
J go P oa 


which is also the right-hand side of (32) in that case. 
Identity (33). The right-hand side of (5) gives, since f(a) = 0, 


a b = B " 
tg f fe +ods+ | go E t? fro. 


The first integral goes to zero. The second one equals the right-hand side of 
(33). 
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Proposition 9 Let y € C([—T,0]) be such that the quadratic variation on [—T , 0] 
exists. Let Y : C([-T,0]) — R be twice continuously Fréchet differentiable such 
that 


Dd :C([-T,0) — xoc (C([-7,00&, C([—T, 0))* continuously with respect to xo. 


Let us also suppose the following. 


(i) p2- Dias y(n), the diagonal element of the second-order derivative at n, has a 
set of discontinuity which has null measure with respect to [n] (in particular, if 
it is countable). 

(ii) There exist continuous extensions (necessarily unique): 


u: €([-T,0] > R, Dix au: EU-T, 0]) > Xo 


of Y and D. ay 4 , respectively. 
(iii) The horizontal derivative DP Y (n) exists at n € C([—T, 0]). 


Then 


1 2,Di 
Dr = / DRU ()d* n) = 5 / D; "4 (dli). G4) 
[-T,0] [-T,0] 

In particular, the backward integral in (34) exists. 


Proof Let y € C([—T, 0]). Using the definition of D 4y (n) we are led to consider 
the following increment for the function u: 


u(n) — u(nC — &e)lt-r.or + n(0)10)) 
F , 


(35) 


with £ > 0. Our aim is to expand (35) using a Taylor's formula. To this end, since 
UY is C? Fréchet, we begin noting that for every nı € C([—7,0]) the following 
standard Taylor's expansion holds: 


d) = w+ [ 


Dax 4 (n)(m (x) — n) 
[-T.0] 


1 
F a Die ay% QD (mo) — n(x)) (m6) — nO») 
[-T,0} l 


2 $ 
1 
+f a-»(f (D3, 4,200 An — 0» 
0 [-T,0]2 


= Dj, 4^ 0) (m) — nw) (m6) — 10))o. 
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Now, using the density of Cy) ([—T, 0]) into @,(0)([—T, 0]) with respect to the 
topology of € ([—T, 0]) and proceeding as in the proof of Proposition 7, we deduce 
the following Taylor's formula for u: 


u(n) — u(C — &e)lp-r,op + n(0)1{0}) 


: (36) 
T, pi vo? 16 =) 
[-T,0] £ 


1 vr? = — — 
zd e" DÀ, gr y (ae) = ne = 6) = nO 2 E 


2 € 


xr (1— (J, id (Di, ayt + ACN — £) — nO) HL roD 
) (n(x) — n(x — 900») — n — €)) 


£ 


em. ay V QD) Ir-To[x [- T,0[ GC. »)o. 


Recalling the definition of xo given in Remark 11, we notice that (due to the presence 
of the indicator function 1[..,o0[x[—T,0[) 


/ D2, gy K) (n(x) — n(x — e) Qm) — nO — €)) «isset duy 


€ 


l D2 LUN 069 7 n& EN) = HO — s» 
[-T.0? 


€ 


" | prPiag gy (yy (100 — Ne = 8) 4 
[-T,0] i E 


dxdy 


where, by hypothesis, the maps n € @([—T, 0]) tb De u(n) € L?({—T, 0]?) and 


n € @((-T, 0) > D? Dias (gy € L**([-T, 0]) are continuous. In particular, (36) 
becomes 
— u(nC = a)l- 0)1 
u(n) — u(n( D rop nO) — h(e) + hte) + Ble) + Ia(&) + Is(e), 
(37) 
where 
Ih) = f piy Oe 
[+7,0] $ 
In(e) := -; | D? LUN) (n(x) — n(x — (0) — ny — D aay. 
2 J[-r.op z 


j = = 2 
ccm i J - po Pia, (gy I) me D is 
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1 
ne - f a-»(f (DEP utn MC — ©) — nO) lap 
0 [-T.0]? 
7 pila) (n(x) — n(x — 00) 10-79 y. ay) an, 
1 , 
Is(e) = — n a-»( / (Dè "uy + XC = 8) = nOr 
0 [-T,0] 


dA. 


; " EDS 
u D n) (n(x) ve é)) dx) 
Firstly, we shall prove that 
sot 
bey => 9. (38) 


To this end, for every € > 0, we define the operator T; : L^([-T,0]) — Ras 
follows: 


gas f zo (n(x) — n(x — 2 — oye Ð) xdi: vg e L-T, 0]. 
[-7,0]? 


Then T; € L?({—T, 0])*. Indeed, from Cauchy-Schwarz inequality, 


— = 2 — = 2 
|Tz 2| = Istar f -— (n(x) n(x €)) (n(y) ny £)) dxdy 
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(n(x) — n(x — 8)? 
lg llz2q—r7,012) dx 
[-T,0] E 


^ 


and the latter quantity is bounded with respect to £ since the quadratic variation of 
n on [—T, 0] exists. In particular, we have proved that for every g € L*u-rop) 
there exists a constant M, > 0 such that 


sup |T g| < Mg. 


0<e<1 


It follows from Banach-Steinhaus theorem that there exists a constant M > 0 such 
that 


sup |ITelz2q-r,op* < M. (39) 


0<e<1 


Now, let us consider the set Z := {g € L?([-T, 012): g(x, y) = e(x) f(y), with 
e, f € C' ([-T, 0])}, which is dense in L?({—T, 0]?). Let us show that 


g—0* 


Teg — 0, VWge-. (40) 
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Fix g € F, with g(x, y) = e(x) f(y) for any (x, y) € [- T, 0], then 
1 
Teg = 3 e(x) (n(x) — nox-e))dx f f(y)(nQ) - nQ-e))dy. (41) 
€ J[-T,0] [-T,0] 
We have 


n e(x)(n(x) — nw — d 


-| io (e(x) —e(x+ €))n(x)dx 


— f e(x + e)n(x)dx + | e(x + e)n(x)dx 
[-T—e,—T] [-e,0] 


< «(f léGoldx + elas) ntis 
[-T,0] 


Similarly, 


|f FO n) — nO — €))dy 
[-T,0] 


< (/ foit» +21 fo) blo 
[-T.0] 


Therefore, from (41) we find 


na < (f cotes + 2e) ( f Potty + 21. f lo int. 
[-T,0] [-T,0] 


which converges to zero as € goes to zero and therefore (40) is established. This in 
turn implies that 


: 
Tg S 0, vee L2(-T,0)). (42) 


Indeed, fix g € L^([-T, OP) and let (gn)n C Z be such that g, — g in 
L?([—-T, 0]”). Then 


ITeg| < |Te(g — 8n)| + ITe gn] < WTellz2q—7,0)2)*ll8 — 2nlr2q-r. op) + 17e 8al. 


From (39) it follows that 


ITeg| < M\lg — 8nllz2q—7,02) + |Ze nl. 


which implies lim sup,_,9+ |Te g| < M||8 — gnllz2q—7,0)2)- Sending n to infinity, we 
deduce (42) and finally (38). 

Let us now consider the term /3(¢) in (37). Since the quadratic variation [7] exists, 
it follows from Portmanteau’s theorem and hypothesis (i) that 
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(n(x) — n(x — £)? m 
€ 


e>0t 


Ie) = D p? Piet 2, (n) > f Dy P2 (ndina). 
[-T,0] [-T,0] 


Regarding the term /4(e) in (37), let $}: [0, 1]? > L?([—T, 01°) be given by 


Prle, AC.) = DF u(n 4- AC — e) 2 nM rop 


By hypothesis, $; is a continuous map, and hence it is uniformly continuous, since 
[0, 1]? is a compact set. Let Poy denote the continuity modulus of $;, then 


|D u(n + XC = £) 7 nODli rot) -PPY Gs rog; 
= |ne, à) — 6,0, A) ll r2 ,0p X Pon (e). 


This implies, by Cauchy-Schwarz inequality, 


, 2,L? 
f (1— »( f 5 (Dey u(n + A(n(- — 6) — nCDl(— T, op 
0 [-T.0? 


(n(x) — n(x — 90») — ny — €)) 
€ 


2 
«Dn wo) dxdy) | 


! 2.L2 
< n 1 - 3| D2? ut AC — e) — OX rop 


2 = = 2 — — 2 
a) rp, / [a a) - ne - eA) - = DP s 


1 Labo 
< f a = ne f eo e 9s ge. 
0 [-T,0] E 


ol (0)-59€-59Y , sot 
= TAGI f E dx > 0. 


Finally, we consider the term /5(&) in (37). Define y; : [0, 1]? — L**([—T,0]) as 
follows: l 
WE DO = DLP, (n KC — e) — nO) 1-r,01)- 


We see that Yy is uniformly continuous. Let py, denote the continuity modulus of 
V^, then 


| D?- P48 y(n ANG m £) m nC) ror) m D>- Dias Y (n) | resto 
= |vs(e. X) — WO, X)llpeq-r.op < Pw,(). 
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Therefore, we have 


1 ! 
| i a= »( / ITO + GIC — €) 2 nC) D 
0 [-T,0] 


! Loads 
= pz PEU (n) (n(x) ue £)) ax) aa 


1 _ hs? 
J qd -»( | ie n(x — &)) as) dn 
0 [-T.0] E 


_ oy 
= j^ | (n(x) — n(x — €)) dis eot 0 
2 [-T,0] E 


In conclusion, we have proved that all the integral terms in the right-hand side of 
(37), unless J; (£), admit a limit when € goes to zero. Since the left-hand side admits 
a limit, namely D" Y (n), we deduce that the backward integral 


|^ 


n(x) — n(x — £) = / 
£ 


h(e) = J DEU (n) 
[-T,0] [-T, 


" Di. (n)d* n(x) 


exists and it is finite, which concludes the proof. 


3 Strong-Viscosity Solutions to Path-Dependent PDEs 


In the present section we study the semilinear parabolic path-dependent equation 


9,27 -- D 4p - TDVV4y +F, n, UY, DV) = 0, Y(t, n) € [0, T[xC((-T, 0], 
UT, n) = H(n), Yn € C(-T, 0p. 
(43) 
We refer to ZY = 8,27 + DP 4v + ID'V4y as the path-dependent heat oper- 
ator. The results of this section are generalized in [9, 10], where more general 
path-dependent equations will be considered. Here we shall impose the following 
assumptions on H and F. 


(A) H: C([-T,0]) ^ Rand F: [0, T] x C([-7,0]) x R x R > R are Borel 
measurable functions and satisfy, for some positive constants C and m, 


|F(t, 1, Y, z)— F(t, n, y, z’)| s C» — yI + Iz — z'), 
|H(n)| + |F(,n, 0, 0)| < C(1- Ims). 


for all (t, n) € [0, T] x C([—-T, 0]), y, y € R, and z, z € R. 
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3.1 Strict Solutions 


In the present subsection, we provide the definition of strict solution to Eq. (43) and 
we state an existence and uniqueness result. 


Definition 19 A function 7 : [0, T] x C({[—T, 0]) > R in C'2(([0, T[xpast) x 
present) N C([0, T] x C([- T, 0])), which solves Eq. (43), is called a strict solution 
to (43). 


We now introduce some additional notations. Let (2, F , P) be a complete prob- 
ability space on which a real Brownian motion W = (W,;);>0 is defined. Let 
F = (¥;)+>0 denote the completion of the natural filtration generated by W. 


e SP(r, T, p = 1,0 < t < T, the set of real càdlàg F-adapted processes Y = 
(Ys)r<s<r such that 


IY IZ 


SP(t,T) ` 


= ;| sup II^] « oo. 


t<s<T 


e H? (t, TH, p> 1,0 < t < T, the set of R4-valued predictable processes Z = 
(Zs)r<s<7 such that 


T j 5 
p = F 
IZI, pa = |(/ LA 2 | g oo; 


We simply write H? (t, T) when d = 1. 
e A^? (t, T), 0 < t < T, the set of real nondecreasing predictable processes K = 
(Ks)i<s<r € S?(t, T) with K; = 0, so that 


IKI2, := E[IKzI?]. 


S2(nT) 


e LP(t, T; R”), p > 1,0 < t < T, the set of R"-valued F-predictable processes 
$ = (Os)rzs«r such that 


T 
p im nl 
lel, gam, = l f eas "X 


Definition 20 Let: € [0, T] and 7 € C([—T,0]). Then, we define the stochastic 
flow 


n(x t s-—t), -T<x<t-s, 


Wi") = 
n(0) + Wrs — Wr, t-—s <x <0, 


foranyt € s x T. 
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Theorem 4 Suppose that Assumption (A) holds. Let Y : [0, T] x C({[—T, 0]) > R 
be a strict solution to Eq. (43), satisfying the polynomial growth condition 


IYEMI x C(1-- ml), YE, n) € [0,7] x C(—T, 0). (44) 
for some positive constants C and m. Then, we have 
4 (m = Y", V(t, n) € [0, T] x C(-T, 0), 
where (Ys, Zs")set.r) = (U (s, Ws"), DVU (s, Ws?)l,riG)seti,r € S? 


(t, T) x HP? (t, T) is the solution to the backward stochastic differential equation: 
P-a.s., 


T T 
y" = HCW") + / F(r, We", Yp", Zi")dr — J Z;dW, <8 ST. 
AY AY 
In particular, there exists at most one strict solution to Eq. (43). 
Proof Fix (t, n) € [0, T[x C([—T, 0]) and set, for allt € s < T, 
Y!" = Ws, W^" Z^" = DV4 (s, Wo") 1p ris). 


Then, for any To € [t, T[, applying Itó formula (28) to X (s, we") and using the 
fact that Y solves Eq. (43), we find, P-a.s., 


To To 
you = rite f F(r, wr yr zia | ZUqW, t<s<Tp. (45) 
S 


S 
S 


The claim would follow if we could pass to the limit in (45) as To — T. To do this, we 
notice that it follows from Proposition B.1 in [10] that there exists a positive constant 
c, depending only on T and the constants C and m appearing in the statement of the 
present Theorem 4, such that 


k- 


T 
J |F(r, W^",0,0)?dr, VT € [t TT. 


t 


To 
? n2 Ny2 
gf zt Pds = eris rte 


We recall that, for any q > 1, 


s| sup wii] «oc. (46) 


t<s<T 


Notice that from (44) and (46) we have IY ^" lez m) < oo, so that Y € S?(r, T). 
Then, from the monotone convergence theorem we find 
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T T 
) f IZ;"l^ds < cll¥*" gg r) +E LF (r, Wy", 0, 0) Pdr. 
Therefore, it follows from the polynomial growth condition of F and (46) that 
Z €e H?(t, T). This implies, using the Lipschitz character of F in (y, z), that 
an |F(r, Ww”, ys Ze") Pdr < œ, so that we can pass to the limit in (45) and 
we get the claim. 


We conclude this subsection with an existence result for the path-dependent heat 
equation, namely for Eq. (43) with F = 0, for which we provide an ad hoc proof. 
For more general cases we refer to [9]. 


Theorem 5 Suppose that Assumption (A) holds. Let F = 0 and H be given by, for 
all € C([—T, 0]), (the deterministic integrals are defined according to Definition 
4(i)) 


H(n) = «(f pix 4 T)d (x), .. 2 9N (x + Dano), (47) 
[-T.0] [-T.0] 


where 


e h belongs C? (R") and its second order partial derivatives satisfy a polynomial 
growth condition, 
e g1,-..,9n € C’ (LO, TI). 


Then, there exists a unique strict solution Y to the path-dependent heat Eq. (43), 
which is given by 


Ut, n) = E[H(W;")], v n) € [0, TI x C([- T. OD. 


Proof Let us consider the function X : [0, T] x C([—T, 0]) — R given by, for all 
(t, n) € [0, T] x C([- T, 0)), 


U(t, n) = E[H (W7)] 


T 
- UI p+ Ddo f ei6aW.... | 
[-1,0] t 


Z0 ile 4720), | evo Dd), 
[4,0] [-1,0] 


where 


T T 
V (f, X1, ..., XN) mE don f PAW an f exam) |; 
t t 
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for any (t, xi, ..., XN) € [0, T] x R^. Notice that, for any i, j zx. 


T T 
Dy; V (t, x1,..., XN) = e| Ds h(x «f e1GdW.,... xw + f evcoaw.,) |; 
t t 


T T 
D W(t, x1, es tN) = CES e1GdW.,... xw + f evcoaw. ) |; 
: : t t 


so that V and its first and second spatial derivatives are continuous on [0, T] x RN. 
Let us focus on the time derivative 9; V of W. We have, for any 6 > 0 such that 
t +ô € [0, T], 


V (f 3-6, x1, ..., XN) — W(t, xp, ..., XN) 
ó 


1 T T 
= CENA eia...) - nn f nc...... 
ô t+8 t 


Then, using a standard Taylor formula, we find 


V (t --0,xi, ..., XN) — W(t, xi, ..., XN) 
ô 


1 DN T t+6 t+6 
= -52l f a «f vx(syaw, =a | ncm, ..) f exniuda |, 
0 i=l t t t 


Now, it follows from the integration by parts formula of Malliavin calculus, see, e.g., 
formula (1.42) in [31] (taking into account that Itó integrals are Skorohod integrals), 
that, for any i = 1,..., N, 


(48) 


T 14-8 
& Do h(x «f giG)(1 -eses)dW,.. f «iex (49) 


N T t+6 
=(1- e| D? h(x «f gi(s)(1 — alte ies] ())d Ws, «Hu Qi (o,()as], 
j=l * f 


Then, plugging (49) into (48) and letting 5 > 0*, we get (recalling that D? xj has 
polynomial growth, for any i, j) 


N T 
1 
QI x1, ..., XN) = By | D h(t f gi(s)dW,, B 
i,j=l t 
(50) 
for any (t, x1, ..., xw) € [0, T[xR, where at W denotes the right-time derivative 


of W. Since V and or V are continuous, we deduce that 9; V exists and is continuous 
on [0, T[ (see for example Corollary 1.2, Chap.2, in [32]). Moreover, from the 
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representation formula (50) we see that 8; V exists and is continuous up to time T. 
Furthermore, from the expression of n W, we see that 


Qj VU (t, X1, ..., XN) = TR DHT Xp.9X XN). 
2 1 
Therefore, v € C'?([0, T] x RY) and is a classical solution to the Cauchy problem 
awa, + 5 DP i ei(e;(D2, W(x) 20, — V(rx) € [0, T[XRP, 
W(T, x) = h(x), vx c RN. 
(51) 


Now we express the derivatives of 7/ in terms of V. We begin noting that, taking 
into account Proposition 4, we have 


0 
j " gi(x+t)d n(x) = T ngi Œœ t t)dx, | Vn € C([-T, OD. 
ng -t 


This in turn implies that % is continuous with respect to the topology of @([—T, 0]). 
Therefore, % admits a unique extension u: @([—T, 0]) — R, which is given by 


i6) m u(r f pile + dna) I ev + DIT); 
[75,0] [74,0] 


forall (t, n) € [0, T] x € ([—T, 0]). We also define the map: [0, T7] x € ([- T, 0D x 
R — Ras in (13): 


0 
ü(t,y,a) = ult, ylp.ropd aly) = ee y Gi Gr nd, .... 


-—E 


for all (t, y, a) € [0, T] x P. OD x R. Let us evaluate the time derivative 
9,77 (t, n), for a given (t, n) = [0 , xC(- T, 0]: 


dU (t, n) = aos f me evox +)d-n(0)) 
[-1.0] [-1.0] 


N 
S f TETES O G getn). 
Fam [74,0] [74,0] 


Notice that 


0 
a( | " gi (x + nano) = ð (owo -J n(x)ġi (x + nax) 
= T 


0 
n(O)P(t) — n(—1)gi O07) — J n(x)gi(x + t)dx. 
—t 
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Let us proceed with the horizontal derivative. We have 


D UG, n) = Dult, n) = D' üt, m rop n0) 
z u(t, n-r,orC). n(0)) — u(t, n-ro — £), (0) 
fov € 


0 
= dim (v a nOi + di...) 
e> t 


1 0 
— Pid (. .... NOg; (t) -J n(x — £)ġi (x + t)dx,.. ) J 
—t 


From the fundamental theorem of calculus, we obtain 


1 0 
Pul a -f n(x )pi(x +0dx,...) 
=f 


1 0 
- Ps C e nQO)oi() -f n(x — egi (x  D)dx, .. ) 
—t 


e N ð 
1 
= :] 3 Pa (roo - | nts -DAE nds...) (noe 
i=l - 
0 
-J n(x — $i + ndx ay. 
-t 


Notice that 


0 = 
(no (t) — I n(x — ydi x + nax) = -(f nGo)Qi(x + y + nax) 
- - 


t-y 


y 
= (n VAA) - qct vaot | 


-t—y 


napia + y + ous). 
Therefore 


DEY (t, n) 


1 e N 0 
= — lim JI 3 Pat (1 - neo J n — y)Pi@ 4 pds...) (nwa 
e>0t € Jo ia] =} 


— n(-t — y)ġi (0*) +/ JO nOGogdrx ty + nds dy 
tes 


0 
= - Xo: ee T oii nds... (1000 = noh 0 


0 
-f 00e nas). 
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Finally, concerning the vertical derivative we have 


DYU (t, n) = Du(t, m) = dait(t, np. op + n(0)1gop) 
N 
= Zoeff TETTO Jen 
ici [1.0] 


and 


DVY4r(r, n) = DYVu(t, n) = 02, H(t, nli-r,ot + n()19) 


a 
N 


=> z] 6 HDA 0)... Joes. 
i —t, 


From the regularity of W it follows that Y € C L2(([0, T] x past) x present). 
Moreover, since W satisfies the Cauchy problem (51), we conclude that 8; ZZ (t, n) + 
DEU (t, 9) + 3 DV W(t, n) = 0, for all (t, n) € [0, Tx C([—T, 0]), therefore Y 
is a classical solution to the path-dependent heat Eq. (43). 


3.2 Towards a Weaker Notion of Solution: A Significant 
Hedging Example 


In the present subsection, we consider Eq.(43) in the case F = O. This situa- 
tion is particularly interesting, since it arises, for example, in hedging problems of 
path-dependent contingent claims. More precisely, consider a real continuous finite 
quadratic variation process X on (2, F, P) and denote X the window process asso- 
ciated to X. Let us assume that [X]; = t, for any t € [0, T]. The hedging problem 
that we have in mind is the following: given a contingent claim's payoff H (Xr), is 
it possible to have 


T 
Wn ud | AEA (52) 
0 


for some Ho € R and some F-adapted process Z = (Zi);e[o,7j such that Z; = 

v(t, X+), with v: [0, T] x C([-7,0]) —^ R? When X is a Brownian motion W and 

Ji |Z;|?dt < oo, P-as., the previous forward integral is an Itó integral. If H is 

regular enough and it is cylindrical in the sense of (47), we know from Theorem 5 

that there exists a unique classical solution Y : [0, 7] x C([—T, 0]) > R to Eq. (43). 
Then, we see from Itó's formula (28) that Y satisfies, P-a.s., 


t 
UX) = UO, xo) + f DYU, X) d Xs, O<1<T. (53) 
0 


In particular, (52) holds with Z; = DY X (t, X;), for any t € [0, T], Ho = X (0, X;). 
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However, a significant hedging example is the lookback-type payoff 


Hn) = sup y(x)  VmneCc([-T,O0]. 
xe[-T,0] 


We look again for Y : [0, T] x C([-T,0]) — R which verifies (53), at least for X 
being a Brownian motion W. Since 77 (t, W;) has to be a martingale, a candidate for 
U is Y(t, n) = [LH (WI, for all (t, n) € [0, T] x C([—T, 0]). However, this 
latter Y can be shown not to be regular enough in order to be a classical solution 
to Eq. (43), even if it is “virtually” a solution to the path-dependent semilinear Kol- 
mogorov equation (43). This will lead us to introduce a weaker notion of solution 
to Eq. (43). To characterize the map WY, we notice that it admits the probabilistic 
representation formula, for all (t, n) € [0, T] x C([—T, 0], 


v.» = E[HOW75] =E] s wœ] 


-T<x<0 


= E|( sup ney) v ( sup (w - w +00)))] = (t. sup 162.00), 


—t<x<0 t<x<T = 


where the function f: [O, T] x R x R > R is given by 


f(t,m,x) = E[mv (Sr++x)], VG,m,x)€[0,T]xRxR, (54) 


with S, = supg<,<; Ws, for all t € [0, T]. Recalling Remark 3, it follows from the 
presence of SUD_;<1<0 n(x) among the arguments of f, that Y is not continuous 
with respect to the topology of € ([—T, 0]), therefore it can not be a classical solution 
to Eq. (43). However, we notice that sup_,—, <9 (x) is Lipschitz on (C([—T, 0]), ||: 
llo), therefore it will follow from Theorem 7 that X is a strong-viscosity solution 
to Eq. (43) in the sense of Definition 21. Nevertheless, in this particular case, even 
if Y is not a classical solution, we shall prove that it is associated to the classical 
solution of a certain finite dimensional PDE. To this end, we begin computing an 
explicit form for f, for which it is useful to recall the following standard result. 


Lemma 1 (Reflection principle) For every a > O0 and t > 0, 
P(S, =a) = P(|Bi| > a). 


In particular, for each t, the random variables S, and |B,| have the same law, whose 


density is given by: 
2 _2 
p(z) = uu Tlo), WzZER. 
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Proof See Proposition 3.7, Chapter HI, in [35]. 


From Lemma 1 it follows that, for all (t, m, x) € [0, T[x R x R, 


oo 


ft, m, x) zi m v (z + x) er, (3)dz zx m v (z 4 x) 
0 0 


where q(z) = exp(z*/2)/./27, z € R, is the standard Gaussian density. 


Lemma 2 The function f defined in (54) is given by, for all (t, m, x) € [0, T[x R x 
R, 


= 1 = 2(T — t) _(m-x)? 
remm 2n(o( 222) - P) (i o ( 222) + De we, 


for x < m, and 
2(T — t) 
f(t,x,m) = x+ =F. 728 
T 


for x > m, where ®(y) = in y(z)dz, y € R, is the standard Gaussian cumulative 
distribution function. 


Proof First case: x < m. We have 


m-—x 2 z oo 2 Z 
‚m, x) = d UE 
ION. / m T =h — z) aj ed — ol —t 


Jaz. 65) 
The first integral on the right-hand side of (55) becomes 


me 2 Z ) p m-—x 1 
m dz = 2m z)dz = 2m(e( )-;) 
[i ru 0 9t T—t/? 2 


where e (y) = x qQ(z)dz, y € R, is the standard Gaussian cumulative distribution 
function. Concerning the second integral in (55), we have 


zx dz — WT 21 Zz ade+ 2x f z)dz 
11 Tu) T dcs i m—x ve) < 


J/T—t 


= [Eet eai a( 07) 
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Second case: x > m. We have 


fem | eto) 
‚m, x) = : zx rid T4 


oT = | zodz + 2x | o(z)\dz = (ste 
0 0 


We also have the following regularity result regarding the function f. 


Lemma 3 The function f defined in (54) is continuous on [0, T] x R x R, moreover 
it is once (resp. twice) continuously differentiable in (t, m) (resp. in x) on [0, T[xQ, 
where Q is the closure of the set Q := ((m,x) € R x R: m > x}. In addition, the 
following Itó formula holds: 


15 sS. B,))ds (56) 


t 
f(t, Sı, B) = f(0,0, o» f (8.fG. Ss, Bs) + A 


t 


t 
+f ðm f (s, Ss, B,)d S, i ax f (s, Ss, B,)d Bs, 0xt-T, P-as. 
0 0 


Proof The regularity properties of f are deduced from its explicit form derived in 
Lemma 2, after straightforward calculations. Concerning Itó's formula (56), the proof 
can be done along the same lines as the standard Itó formula. We simply notice that, 
in the present case, only the restriction of f to Q is smooth. However, the process 
((S;, B;)); is Q-valued. It is well-known that if Q would be an open set, then Itó's 
formula would hold. In our case, Q is the closure of its interior Q. This latter property 
is enough for the validity of Itó's formula. In particular, the basic tools for the proof 
of Itó's formula are the following Taylor expansions for the function f: 


film, x) = fü mx) £8 fü mx) — t) 
+ [rece moe - n. 
por ee ee ee 
+f Əm f(t, m +A — m), x)(m' — m)dà, 
fam x)= f(,m,x) + à f, m DA + SE mA — x? 


1 
«f (1 — A) (82, f (t, m, x + A(x! — x)) — 92, f (t, m, x)) (x! — x)?da, 
0 


for all (t, m, x) € [0, T] x Q. To prove the above Taylor formulae, note that they 
hold on the open set Q, using the regularity of f. Then, we can extend them to the 
closure of Q, since f and its derivatives are continuous on Q. Consequently, Itó's 
formula can be proved in the usual way. 
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Even though, as already observed, Y does not belong to C 12(([0, T[xpast) x 
present) NC ([0, T] x C({—T, 0])), so that it can not be a classical solution to Eq. (43), 
the function f is a solution to a certain Cauchy problem, as stated in the following 
proposition. 


Proposition 10 The function f defined in (54) solves the backward heat equation: 


0; f (t, m, x) + 182 f(t, m,x) — 0, V (t, m, x) € [0, T[xQ, 
f(T,m,x) = m, V (m, x) cQ. 


Proof We provide two distinct proofs. 
Direct proof. Since we know the explicit expression of f, we can derive the form 
of ð; f and 92, f by direct calculations: 


afma) = ! o(——=), a ft a) = : d 


for all (t, m, x) € [0, T[x Q, from which the claim follows. 
Probabilistic proof. By definition, the process (f (t, S+, B:))re(o,r] is given by: 


ft, Sr, Br) = E[Sr| F], 


so that it is a uniformly integrable F-martingale. Then, it follows from It6’s formula 
(56) that 


t 1 t 
f (ase, Ss, Bs) + 59 f Gs, Ss, B,))ds «f ðm f (s, Ss, B;)d S, = 0, 
0 0 


for all 0 < t < T, P-almost surely. As a consequence, the claim follows if we prove 
that 


t 
f Om f (s, Ss, Bs)dSs = 0. (57) 
0 


By direct calculation, we have 


Qn f(t,m,x) = 20( ifi )- L — Y(t,m, x) € [0, T[xQ. 


Therefore, (57) becomes 


ri (20(<=*) -1)as = 0 (58) 
0 JT =s bs 
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Now we observe that the local time of S, — B, is equal to 25;, see Exercise 2.14 in [35]. 
It follows that the measure dS, is carried by (s: S, — B, = 0}. This in turn implies 
the validity of (58), since the integrand in (58) is zero on the set (s: S, — B, = O}. 


3.3 Strong-Viscosity Solutions 


Motivated by previous subsection and following [10], we now introduce a concept of 
weak (viscosity type) solution for the path-dependent Eq. (43), which we call strong- 
viscosity solution to distinguish it from the classical notion of viscosity solution. 


Definition 21 A function Z : [0, T] x C([-T,0]) > R is called strong-viscosity 
solution to Eq. (43) if there exists a sequence (%,, Hn, Fn), of Borel measurable 
functions Y%,: [0, T] x C([-7T,0) — R, H,: C(-7,0]) —> R, F,:[0, T] x 
C([-T, 0p x R x R — R, satisfying the following. 


(i) Forallt € [0, T], the functions 7, (t, -), H4), Fn(t, +, +, +) are equicontinuous 
on compact sets and, for some positive constants C and m, 


IFa(t, n, y, z) — Fa(tm yz) x Cy — yl lz- z p. 
127, t, MI + |Hn(n)| + |Fr(t,n, 0,0)| < C(1+ Ini), 


for all (t, n) € [0, T] x C({-T, 0]), y, y € R, and z, z € R. 
(ii) 4, is a strict solution to 


9,25, + DEY + TDYV + Falt, N, Ya, DVM) =0, Y(t, n) € [0, TIxC([— T, Op, 
A. (T, n) = An(n), Vn € C([-T,0]. 


(iii) (a, Hn, F4) converges pointwise to (77, H, F) as n tends to infinity. 


Remark 12 (i) Notice that in [8], Definition 3.4, instead of the equicontinuity on com- 
pact sets we supposed the local equicontinuity, i.e., the equicontinuity on bounded 
sets (see Definition 3.3 in [8]). This latter condition is stronger when % (as well as 
the other coefficients) is defined on a non-locally compact topological space, as for 
example [0, T] x C([—T, 0]. 

(ii) We observe that, for every t € [0, T], the equicontinuity on compact sets of 
(M(t, -))n together with its pointwise convergence to X (t, -) is equivalent to requir- 
ing the uniform convergence on compact sets of (7, (t, -)), to W(t, -). The same 
remark applies to (Hn (-)), and (Fn (t, <+, -))5. t € [0, T]. 


The following uniqueness result for strong-viscosity solution holds. 
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Theorem 6 Suppose that Assumption (A) holds. Let Y : [0, T] x C([-T, 0]) > R 
be a strong-viscosity solution to Eq. (43). Then, we have 


4 (9 = Y^, Y(t, n) € [0, T] x C(-T,0), 


where YE”, 2 y ean e S? (t, T) x H(t, T), with y" = Y s, we”), solves the 
backward stochastic differential equation, P-a.s., 


T T 
y^" = now's) f FG, WE", Y^, inar — [ ZPdW, t<s<T. 
s AY 


In particular, there exists at most one strong-viscosity solution to Eq. (43). 


Proof Consider a sequence (77 , Hn, Fn)n satisfying conditions (i)-(iii) of Definition 
21. For every n € N and any (t, n) € [0, T] x C([—T,0], we know from Theorem 
4 that (Y? ^", Ze ser] = (Uls, Ws"), DY 24, (s, Ws"))sett.7] € S? (t, T) x 
HP (t, T) is the solution to the backward stochastic differential equation, P-a.s., 


T T 
y e mw [ F.(r, Wi" ren, zar [| ZUUgW. o 22927. 
s s 


Our aim is to pass to the limit in the above equation as n — oo, using Theorem C.1 
in [10]. From the polynomial growth condition of (77), and estimate (46), we see 
that 


sup|Y'"^"|s» r, < oo,  forany p> 1. 
n 


This implies, using standard estimates for backward stochastic differential equations 
(see, e.g., Proposition B.1 in [10]) and the polynomial growth condition of (Fy)n, 
that 


F 
sup | Z> lma,r) < oo. 
n 


Let Y, ic " = Y (s, we”), for any s € [t, T]. Then, we see that all the requirements of 
Theorem C.1 in [10] follow by assumptions and estimate (46), so the claim follows. 


We now prove an existence result for strong-viscosity solutions to the path- 
dependent heat equation, namely to Eq. (43) in the case F = 0. To this end, we 
need the following stability result for strong-viscosity solutions. 


Lemma 4 Let (Mx, Hi k, Fn k)n jo (Ys Hn, Fn)n, (Y, H, F) be Borel measur- 
able functions such that the properties below hold. 


(i) Forallt € [0, T], the functions 4Z, k(t, ), Ha C), and Fa (t, n,k EN, 
are equicontinuous on compact sets and, for some positive constants C and m, 


Functional and Banach Space Stochastic Calculi ... 75 


[Fo kt n, y, z) — Fa k(t, m yz) x Cy — yl lz—z p. 
Wet, MI + LH k| + LES. 0, 0,0)! < C(1- linli), 


for all (t, n) € [0, T] x C(—7T,0], y, y € R, and z, z € R. 
(ti) ak is a strict solution to 


0, 2k + DE Unk + IDVV4,, 
+ Fi k(t, n, Unk, DV Vak) = 0, V (t, n) € [0, T[x C([- T, 0p. 
AQ, n) = Ane), Yn € C(-T, 0p. 


(iii) (Vak, Ha, Fn, k) converges pointwise to (AZ, Hn, Fn) as k tends to infinity. 
(iv) (QM, Hn, Fn) converges pointwise to (Y , H, F) as n tends to infinity. 


Then, there exists a subsequence (Wn x, , Ha k,, Fnk,)n Which converges pointwise 
to (V , H, F) as n tends to infinity. In particular, Y is a strong-viscosity solution to 
Eq. (43). 


Proof See Lemma 3.4 in [8] or Lemma 3.1 in [10]. We remark that in [8] a slightly 
different definition of strong-viscosity solution was used, see Remark 12(1); however, 
proceeding along the same lines we can prove the present result. 


Theorem 7 Suppose that Assumption (A) holds. Let F = 0 and H be continuous. 
Then, there exists a unique strong-viscosity solution Y to the path-dependent heat 
Eq. (43), which is given by 


U(t, n) = E[HOW;")], V(t.) € [0, T] x C(I- T. 0. 


Proof Let (e;);>0 be the orthonormal basis of L? ([-T, 0]) composed by the func- 
tions 


eo = L eoi —1(x) = [sin (Fo+n), eoi (x) = [2 00s (Fo+n), 
JT T T T T 


for alli € NX (0). Let us define the linear operator A: C([—T, 0) — C([—T,0] by 


UC 


(An)(x) = T 


; x € [-T, 0], n € C([- T, 0]. 

Notice that (n — Ag) (—T) = (n — An)(0), therefore  — An can be extended to the 
entire real line in a periodic way with period T, so that we can expand it in Fourier 
series. In particular, for each n € N and y € C([—T, 0]), consider the Fourier partial 
sum 


Sn(n — An) = Vai -(Am)e, Yn € C([-T, 0)), (59) 
i=0 
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where (denoting e; (x) = des ei (y)d y, for any x € [—T, 0]), by Proposition 4, 


0 
y = / n(x)ej(x)dx = vox. - f ei(x)d n(x) 
T [-T.0] 


z | (66) eta FG), (60) 
[- 7,0] 


since 7(0) = Jiro d n(x). Moreover we have 


0 1 0 
(Api = J AMW ead = 2 J sacoda( | 4796) - (1). 
—T -T [-T,0] 
(61) 


Define 


so d $1 oct + Sn 
nal 


On = 


Then, by (59), 


n 


- 
e An) = ZH E -ADe Yn e CUT. OD). 
i=0 


We know from Fejér’s theorem on Fourier series (see, e.g., Theorem 3.4, Chapter 
MI, in [44]) that, for any n € C([—T,0]), o4(n — An) — n — An uniformly on 
[—T, 0], as n tends to infinity, and ||o,(7 — An)|loo x |In — Anllo. Let us define the 
linear operator Ta: C([—T, 0]) — C([—7,0] by (denoting e-1(x) = x, for any 
x e[-T,0p 


n 


n+l—i n(0) — n(-T) 
>, tt — (An)i)ei + EE END 


Tan = On(n — An) + An 


n+1-i 
= > ———— xiei + x_1e-1, (62) 
c 8 +1 


where, using (60) and (61), 


1 1 
w= d(x) n(-T), 
bow T T 


0 0 
x= I (a — @ (x) — =J eode ano + =J xei(x)dx n(—T), 
[-T.0] T jot T jor 
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fori = 0,...,n. Then, for any n € C([—T, 0]), Tan — n uniformly on [—T, 0], as 
n tends to infinity. Furthermore, there exists a positive constant M such that 


lTnnlloo < Mlnilo, | Vn €N, Yn € C([-T, 0p. (63) 


In particular, the family of linear operators (7;), is equicontinuous. Now, let us 
define H,: C([-T,0]) — R as follows 


H,() = H(Tan), | Vn e C(-T, 0]. 


We see from (63) that the family (An) n is equicontinuous on compact sets. Moreover, 
from the polynomial growth condition of H and (63) we have 


a € CU+(Tinll) < CA +M”) Yn €N, Yn e C(-T, 0). 


Now, we observe that since {e—1, €0,€1,..., €n} are linearly independent, then 
we see from (62) that Tan is completely characterized by the coefficients of 
€_1, 69, €1, .. . , €n. Therefore, the function h, : R"*? — R given by 


n 


ha(x-1 x9) = Hp = (> 


i=0 


n+1-i 


"epp A tries), V (x1... 24) € R2, 


completely characterizes H,,. Moreover, fix n € C([—T, 0]) and consider the cor- 
responding coefficients x—1, ..., Xn with respect to {e_1,..., €n} in the expression 
(62) of Tan. Set 


1 . . 1 f° 

9e-1()— =,  di(x) = ei(0) — eix — T) — =J xei(x)dx, | x €[0,T], 
T T J.T 

Z I / xe (0d 
4j = T q = T URGE x. 

Notice that Q1, ..., 9, € C% ([0, T]). Then, we have 

Hy) -i(f pice + D4 Gai D)... [ eG Dd ns) aui). 
J [-T,0] [-T,0] 


Let (x) = cexp(1/G? — T?))ljor((x), x > 0, with c > 0 such that fo^ $ (x) 
dx = 1. Define, for any E€ > 0, $«(x) = b(x/e)/e, x > 0. Notice that d, € 
C*? ([0, oo[) and (denoting $« (x) = i $« Cy)dy, for any x > 0), 


0 
E n(x)be(x + T)dx = nO)ġ:(T) -f 


be(x + T)d~ n(x) 
[-T.0] 


-=f ($e (T) — be(x + T))d~ n). 
[-T,0] 
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Therefore 
5 - 0 
lim (G(T) — be(x +T))d~ n(x) = lim f nx)ps(x +T)dx = n(-T). 
e>0* J[-T,0] £0t* J—T 


For this reason, we introduce the function H, : C([—T, 0]) — R given by 
Hy(n) = Zi end, mee nacionem f D oe D)47 noD. | 
[-T,0] [- T.0] 


Now, for any n € N, let (hn,k)ķen be a locally equicontinuous sequence of 


C? (R"??: R) functions, uniformly polynomially bounded, such that /1, , converges 
pointwise to /1,, as k tends to infinity. Define H, 4: C([—T,0]) — R as follows: 


Ha = (sf ei Td noa | 
[- 7,0] 


[-T.0] 


($n (T) — bn(x +T))d~ n(x), .. 3 


Then, we know from Theorem 5 that the function 77, 4: [0, T] x C([—T, 0]) —^ R 
given by 


Un at, n) = E[H AOV], Y, n) € (0, T] x C[—T. OD 


is a classical solution to the path-dependent heat Eq. (43). Moreover, the family 
(UYa,.k)n,e,k is equicontinuous on compact sets and uniformly polynomially bounded. 
Then, using the stability result Lemma 4, it follows that Y is a strong-viscosity 
solution to the path-dependent heat Eq. (43). 
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Nonlinear Young Integrals via Fractional 
Calculus 


Yaozhong Hu and Khoa N. Lé 


Abstract For Hólder continuous functions W(t, x) and g, we define nonlinear 
integral f bg W (dt, p+) via fractional calculus. This nonlinear integral arises naturally 
in the Feynman-Kac formula for stochastic heat equations with random coefficients 
(Hu and Lé, Nonlinear Young integrals and differential systems in Hólder media. 
Trans. Am. Math. Soc. (in press)). We also define iterated nonlinear integrals. 


Keywords Nonlinear integration * Young integral * Iterated nonlinear Young 
integrals 


1 Introduction 


Let {gy , t > 0} be a Hólder continuous function and let {W (t, x),t > 0,x € R4] 
be another jointly Hólder continuous function of several variables (see (10) for the 
precise statement about the assumption on W). The aim of this paper is to define the 
nonlinear Young integral f E W (dt, p1) by using fractional calculus. 

This paper can be considered as supplementary to authors’ recent paper [5], where 
the nonlinear Young integral is introduced to establish the Feynman-Kac formula for 
general stochastic partial differential equations with random coefficients, namely, 


Qu (t, x) + Lu(t, x) + u(t, x)9,W (t, x) =0, (1) 
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where W is a Hólder continuous function of several variables (which can be a sample 
path of a Gaussian random field) and 


d d 
1 
Lut, x) = 5 S aij (t,x, W)82 u(t, x) + >) bilt, x, W)ds, u(t, x) 


J 
i,j=l i=l 


with the coefficients aj; and b; depending on W. The terminal condition for the 
Eq. (1) is u(T, x) = ur (x) for some given function u 7 (x). 

To motivate our study of the nonlinear Young integral, let us recall a basic result 
in [5] on the Feynman-Kac formula: Let o(t, x) = (oij(t, x, W))i<i,j<a satisfy 
a(t, x, W) = o (t, x, W)o (t, x, W) (we omit the explicit dependence of o on W). 
Consider the following stochastic differential equation 


ax) S0@,2," 08 +b(t, dt, O<r<t<T, X =x,  Q 


where (B;,0 < t < T) is a standard Brownian motion and êB, denotes the 
Itó differential. Then it is proved in [5] that under some conditions b and o and 
W (which are verified for certain Gaussian random field W), the nonlinear inte- 
gral f T W(ds, X"*) is well-defined and exponentially integrable and u(r, x) — 
zB [ur (X5 )exp [4 W (ds, x? | is a Feynman-Kac solution to (1) with u(T, x) 
= ur (x). One of the main tasks in that paper is the study of the nonlinear Young 


integral Ld W (ds, X,^). To this end we used the Riemann sum approximation and 
the sewing lemma of [2]. In this paper, we shall study the nonlinear Young inte- 
gral f E W (dt, p1) by means of fractional calculus. This approach may provide more 
detailed properties of the solutions to the equations (see [6, 7]). 

Under certain conditions, we shall prove that the two nonlinear Young integrals, 
defined by Riemann sums (through sewing lemma) or by fractional calculus, are the 
same (see Proposition 2). 

To expand the solution of a (nonlinear) differential equation with explicit remain- 
der term we need to define (iterated) multiple integrals (see [3]). We shall also give a 
definition of the iterated nonlinear Young integrals. Some elementary estimates are 
also obtained. 

The paper is organized as follows. Section2 briefly recalls some preliminary 
material on fractional calculus that are needed later. Section 3 deals with the nonlinear 
Young integrals and Sect. 4 is concerned with iterated nonlinear Young integrals. 


2 Fractional Integrals and Derivatives 


In this section we recall some results from fractional calculus. 
Let —oo < a < b < co, a > Qand p > 1 be real numbers. Denote by L? (a, b) 
the space of all measurable functions on (a, b) such that 
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b 
HN ] ova m 


Denote by C ([a, b]) the space of continuous functions on [a, b]. Let f € E [a, b]). 
The left-sided fractional Riemann-Liouville integral 1%, f is defined as 


if ()=—— | (t— 5*7 f(s)ds, t€ (a,b) (3) 


I'(a) 


and the right-sided fractional Riemann-Liouville integral 7%_ f is defined as 


I5- jüst 


a fe -A| f (s)ds, te (a,b) (4) 


where (—1)7® = e77* and F (a) = h r*—e7' dr is the Euler gamma function. 
D. : aa (LP) (resp. I?" (L")) be the image of L? (a, b) by the operator 1%, (resp. 

). If f € 1% (LP) (resp. f € Ip. (L?)) and 0 < a < 1, then the (left-sided or 
EUN sided) Weyl derivatives are defined (respectively) as 


— 2 fO | [f0-f0, 
PejO-ra-3(u-a 7] Goat P" 
and 
sun € [ fo Lf fo-fo, 
D,- f (t) = rd—-a | b-p ta ? Len 4 , (6) 
f 


wherea < t < b (the convergence ofthe integrals at the singularity s = t holds point- 
wise for almost all t € (a, b) if p = 1 and moreover in LP -sense if 1 < p < co). 
It is clear that if f is Hólder continuous of order u > o, then the two Weyl 
derivatives exist. 
For any f € (0, 1), we denote by C P([a, b]) the space of 8-Hólder continuous 
functions on the interval [a, b]. We will make use of the notation 


— f(8 
lfllg;a.p = sup IFE) = fO 


a<0<r<b Ir — 6) 


(which is a seminorm) and 


IIflloo;a,b = sup |f), 


a<r<b 
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where f : IR — R is a given continuous function. 

It is well-known that C^ ([a, b]) with the Hólder norm || Fllgia.b + Mf lloo:a,b is a 
Banach space. However, it is not separable. 

Using the fractional calculus, we have (see [9] and also [3]) 


Proposition 1 Let 0 <a < 1. If f and g are continuously differentiable functions 
on the interval [a, b], then 


b b 
J fag= (-1)" J (DS, f ©) (Di-^ e- ©) dt, 7) 


where gp- (t) = g (t) — g (b). 


In what follows « denotes a universal generic constant depending only on A, T, œ 
and independent of W, o and a, b. The value of x may vary from occurrence to 
occurrence. 

For two function f, g : [a, b] — R, we can define the Riemann-Stieltjes integral 


f? f(t)dg(t). Here we recall a result which is well-known (see for example [3, 9] 
or [6, 7]). 


Lemma 1 Let f and g be Hólder continuous functions of orders a and B respec- 
tively. Suppose that a + B > 1. Then the Riemann-Stieltjes integral f f (0dg(t) 
exists and for any y € (1 — B, œ), we have 


b b 


J f (t)dg(t) = (-1)” J DY f (t) D) 7 gy (0dt . (8) 


a a 


Moreover, there is a constant k such that 


b 
n < Kllellp:a,o(f llooa,olb — al? + If lla;a blb — atA. O) 


Proof We refer to [9] or [3] for a proof of (8). We shall outline a proof of (9). Let y 
be such that a > y > 1 — P. Applying fractional integration by parts formula (8), 
we obtain 


b b 
J 1s < [oz fOD; goo 


From (5) and (6) it is easy to see that 


1— = 
ID,” gy (£)| < «llglig;an(b — PTY" 
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and 
ID} f (01 < Kif looa, b(t — a) Y + If lla;a b(t — ^? ]. 


Therefore 


J f (t)dg(t)| < Kllgilp:a.p | Ilf loo; ]t-7e- ora 


b 


Il fllesa,b fe — a)" (b — pPtY-gr 


a 


The integrals on the right hand side can be computed by making the substitution 
t — b — (b — a)s. Hence we derive (9). 


We also need the following lemma in the proofs of our main results. 


Lemma 2 Let f(s,t),a < s «t < b be a measurable function of s and t such that 


j | ege dias 


Then 
b b 


| Bir t)|r-;dt = coe f ig! f (t, tP)|iidt. 


a a 
Proof An application of Fubini's theorem yields 


b 


] re vua = ns] ] ads dt 


E ff f(s.) 
“Fel | G94 


e f i f, t )iidt 


a 


which is the lemma. 
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3 Nonlinear Integral 


In this section we shall use fractional calculus to define the (pathwise) nonlinear 
integral i. W (dt, g,). This method only relies on regularity of the sample paths 
of W and g. More precisely, it is applicable to stochastic processes with Hólder 
continuous sample paths. 

Another advantage of this approach is that in the theory of stochastic processes 
it is usually difficult to obtain almost sure type of results. If the sample paths of the 
process is Hólder continuous, then one can apply this approach to each sample path 
and almost surely results are then automatic. 

In what follows, we shall use W to denote a deterministic function W : Rx R? > 
IR¢. We make the following assumption on the regularity of W: 


(W) There are constants z, A € (0, 1] such that for all finite a < b and for all 
compact sets K of Rf, the seminorm 


|W lc xa b. K 
: IW (s, x) - W(t, x) - W(s, y) - WO, y)| 
:= sup = x 
axs«txb |t = s| |x = yl 
x,yEK;x £y 
W , -W t, W t, =W t, 
stip |W(s, x) (t, x)| d. üp IW (t, y) 4 2l. (10) 
a<s<t<b |t — s|* a<t<b lx — y] 
xeK x,yeK;xzy 
is finite. 


About the function o, we assume 
(0$) ois locally Holder continuous of order y € (0, 1]. That is, the seminorm 
lelusc sup POLO 
dod a<s<t<b It = s|” 
is finite for every a < b. 


Among the three terms appearing in (W), we will pay special attention to the first 
term. Thus, we denote 


IW, x) - Wit, x) - Ws, y) + WE YL 


a<s<t<b |t — s|*|x ud y^ 
x,yeK;xxy 


If a, b is clear from the context, we frequently omit the dependence on a, b. In 
addition, throughout the paper, the compact set K can be chosen to be any compact set 
containing the image of ¢ on the interval of integration. Thus we omit the dependence 
on K as well. For instance, || W ||;,;; is an abbreviation for || W ||; 55,5, k . llglly is an 
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abbreviation for || ||y;a,b and so on. We shall assume that a and b are finite. Thus it 
is easy to see that for any c € [a, b] 


sup |g(?)| = sup |g(c) + g(t) — e)I < eol  llellylb — al” « oo. 


a<t<b a<t<b 


Thus assumption (@) also implies that 


IPlloo:a,b := sup |(t)| <œ. 
a<t<b 


Remark 1 Given a stochastic process indexed by (ft, x), it is possible to obtain 
almost sure regularity of the type (10) by a multiparameter Garsia-Rodemich-Rumsey 
inequality. Indeed, this has been explored in [4], see also the last section of [5]. 


One of our main results in this section is to define f? W (dt, q;) under the condition 
Ay +t > 1 through a fractional integration by parts technique. The following 
definition is motivated from Lemma 1. 


Definition 1 We define 


b b 
7 W (dt, pi) = (-D* J D** Dj" Wy (t, gy)lr-idt (11) 
a a 


whenever the right hand side makes sense. 


Remark 2 Assume d = 1. Let W(t, x) = g(t)x be of the product form and let 
g(t) = f(t), where g is a Hólder continuous function of exponent t and f isa 
Holder continuous function of exponent A. If 1 — t < o < A, then 


b b 
fwa gi) = cy f Dey! ni we. t')|yardt 
a 


a 


b 


= cy f pi" e ODS OA. 


a 


Thus from (8), r W (dt, p1) is an extension ofthe classical Young integral I f@dg(t) 

(see [3, 8, 9]). For general d, if W(t,x) = Y4 gi (t)xi and g(t) = 
qa? 

fi(t), then it is easy to see that [^ W(dt, qj) = 5 J fi(t)dgi(t). 


izla 


The following result clarifies the context in which Definition 1 is justified. 
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Theorem 1 Assume the conditions (W) and (@) are satisfied. In addition, we sup- 


pose that Xy +t > 1. Leta € (1 — v, Ar). Then the right hand side of (11) is finite 
and is independent of a € (1 — t, A). As a consequence, we have 


b 
fre Qr) 


= cx i Df DI W, (t, @y)lyardt 


J Wp- (t, pı) 
Tr (1 — a) (b — t)1-*(t — a) 


drdt 


b 
e] F Wo-(t, pi) — Wo-(t, pr) 
(b _ t)! (t — rye 


3- tb ag) E j we. dom - a een dsdi 


ZU W(t, H 9) = WE e) + WG 9») on 


— t) a(t = rye 
(12) 
where Wp_ (t, x) = W (t, x) — W (b, x). Moreover, there is a universal constant K 


depending only on t, X and a, but independent W, o and a, b such that 


f wang < kIWllza a bba)" +4 Whe aa bloa p-a), (13) 


a 


where ||W||z.i:a,p = lIWl|zx:a,p,k and K is the closure of the image of (gi, a < 
t <b). 
Proof We denote || W|| = || W||z.a:a.p. First by the definitions of fractional derivatives 


(5) and (6), we have 


PA 1) ^9 | Wp- (t, oy W(t, -W / 
Dj Wy (t, gr) = Ru C. +a af c - V Pus end 
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and 


n pee Wy (t, pr) 


a 


/ 


1 
(-1)!-# 1 Wy (t, gr) à Wp- (t, pr) m Wp_(t, Qr) 
~ Feorü-e)|«e-—a*b-0-«  J (r—ret(b- nre 


dr 


b 
1—a Wt, qp) — W(s, pr) 


(t —ay* , Go ” 


dsdr 


t 
Ü b 
+a o [ a / W(t, oy) — W(s, oy) — W(t, o) + W(s, qp) 
(t NE peti . (s = nm 
a t 


Thus the right hand side of (11) is 
i b b t 
1 Wp- (t, pi) dés a f Wp- (t, pt) — Wo- (t, Pr) , n 
I'(a)I' (1 — a) (b — t)1-9 (t — a) (b — t)1-e (t — rye 


b b 
L a-w | | Wem- WO. 9) , 
(s — t)? (t — a)* 
a t 


(s — t)? (t — ret 


bo t-b 
L aay f [f W(t, Or) — W(s, pi) — W(t, pr) + W(s, ara 


a a t 


=: J+h+h4+h. (14) 


The condition (W) implies 


b 
Ij € «|W Jo - fte (t — a) “dt 
a 


— k|WI(b— a). (15) 


Similarly, we also have 


b b 
I3 € «|W n «qt 6 — ay “dade 
a t 


< KIWI- a). (16) 
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The assumptions (W) and (9) also imply 


Wr- (t, pi) — Ws (t, 99)] € kIIWIIIb — tl" ler — el^ 


À À 
< kIWillell; lb — t t — rp. 


This implies 


b t 
h< «mier; f fo -Ala o r) ldrdt 


a a 
< |Willel (b — ayt . (17) 
Using 


IW, gi) — WG, p) — W(t, gr) + WG. gl € kIWIlellIt — silt — ri”, 


we can estimate 74 as follows. 


b t 6 
A lE essi p RI 
I < «||WIlllell, p-p eG rn didrdt 
aa t 


<«||Willel;o-a”. (18) 


The inequalities (15)-(18) imply that for any œ € (1 — t, yA), the right hand side of 
(11) is well-defined. The inequalities (15)-(18) also yield (13). 

To show (12) is independent of a we suppose o, a € (1— t, Ay), œ” > a. Denote 
B =a' — a. Using Lemma 2, it is straightforward to see that 


b 
c-v* f noti Wy (t, Gr )|e—dt 
a 
b 


x J I Dey Des Dy" Wo-(t, yid 
a 


b 
= (-1)"*8 7 IPE pott ple Ww, (oy) s dt 


a 
b 


=(-1)” i pos pP! DIO" Ww, (e gy) adt 
a 
b 


= (-1)" J DX! pl-* " W, (t, gy) dt. 


a 
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This proves the theorem. 
Now we can improve the equality (13) as in the following theorem 


Theorem 2 Let the assumptions (W) and (@) be satisfied. Leta, b, c be real numbers 
such that a < c < b. Then there is a constant k depending only on t, X and a, but 
independent W, q and a, b, c such that 


b 
J W (dt, pi) — W (b, pe) + W (a, 9c) | < KIW lleza bllell a p — a) ^" . 
a 
(19) 
Proof Leta < c < d < b and let Q(t) = (c) Xic,a) (t), where xc,4) is the indicator 


function on [c, d). Then 


W(t,qg(c) cxt' «d 
W(t, ot’) = 
Wt, 0) elsewhere . 


This means W (t, Q(t")) = W(t, q(c)) xic,ay (t^). Hence, from (8) we have 


b b 
J W(dt, $(t)) = (-D* J DI Wy (t, o) DE xp ay t )leidt 
a 


a 
b 


= yt f ni" wi. e) DE ea dt 
a 
= W(d, g(c)) — W (c, g(c)). 
Let c be any point in [a, b]. Denote Wit, x) = W(t,x) — W(t, 9e). Then W 
satisfies (W). As in the Eq. (14), we have 


b b 
n W (dt, p1) — W (b, gc) + Wa, Ge) = J W (dt, gi) 
a a 
= Kht+th+h+h. 


where L = h and i = [4 are the same as J and 74 in the proof of Theorem 1. But 
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b 
h- 1 — WO, pi) — W(t, Gc) + WO, Pe) 4 
17 Perd —a) (b—1)'-#(t a) 
z (1— o) j W(t, pt) — W (s, pt) — W (t, Gc) + W(s, Ge) 
h= dsdt . 
rlr (1— a) J (s — t-9 (t — a) 
From the assumptions (W) and (@) we see that 
[W (t, qi) — WO, qi) — W(t, ec) + WO, 9c)| 
< «|| Wlheasabllolls iq ub — tlt — ci 
< kIWllzaaollelih ig plb — tlt — al" . 
This implies that 7 
hi < KIWI lel ap — a) ^" . Q0) 
Similarly, we have 
B < IW allel» 4,0 — a) ^. (21) 


Combining these two inequalities (20) and (21) with the inequalities (17) and (18) 


we have 
b 


f 9er om < k|Wllzaaollells.s sb — a) ^ , 


which yields (19). 
Theorem 3 Let the assumption (W) be satisfied. Let 9 : [a, b] — R? satisfy 


ipa tr=a WredaB a sy Disc E a 


a<t<s<b (s — t) ~ 
(22) 
for some £ € (y, oo) and for some constant L € (0, oo). Ift +Ay > 1, then for any 
B<1+ 1711 (we have 


fwa qi) — W (b, pa) + W(a, 94)| < C(b — ay, (23) 


here the constant C does not depend on b — a. 


Proof Asin the proof of Theorem 2 we express f W (dt, p)—W (b, Ga) 3-W (a, Qa) 
as the sum of the terms 7 jo J = 1,2,3,4 (we follow the notation there). First, 
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we explain how to proceed with 14. We shall use C to denote a generic constant 
independent of b — a. Denote 


J :— Wt, qi) — W(s, e) — W(t, pr) + WCs, Gr) 
First, we know that we have 
J < Clt- s|? jt- r]. (24) 
On the other hand, we also have 


J < |W (t, Qr) Eo W(s, Qr) Y W(t, Qa) + W(s, Qa)| 
+W(t, Qr) m W(s, Qr) m W(t, Pa) s W(s, $a)] 
< Cir = siu - at + r - ar] 


< C|t — s|*|t — ap Q5) 


whena <r <t < s < b. Therefore, from (24) and (25) it follows that for any 
P1 => 0 and £5 > 0 with B, + 85 = 1, we have 


J x C[t — s|* |t — r|Pi^Y |t — Po 
If we choose œ and f, such that 
tta>l, BiAy -—a>0 (26) 


then 
Lh < C(b— qyhiay tet i 


For any 8 < 1+ d we can choose a, 61, and fz such that (26) is satisfied 
and . 
l4 x C(b—ay. 


The term /» can be handled in a similar but easier way and a similar bound can be 
obtained. 
Now, let us consider 74. We have 


IW (t, or) — W(s, pi) — W(t, Qa) + W(s, a)l € Clt —s|*|t — ap^. 


This easily yields : 
hsCh-a. 


A similar estimate holds true for /;. However, it is easy to verify t + A£ > 1 + 
EH if £ > y. The theorem is proved. 
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Next, we show that the nonlinear integral defined in Definition 1 coincides with 
the limit of Riemann sums. For this purpose, we need some preliminary set up. For 
every s, t in [a, b], we put u(s, t) = W (t, ps) — W (s, ps). Let m = {a = tọ «ti < 

: « f, = b) be a partition of [a, b] with mesh size |x| = maxj<j<p |ti — ti—1l. 
One can consider the limit of the Riemann sums 


n 


lim Iti, ti) 
m2. TX 


whenever it exists. A sufficient condition for convergence of the Riemann sums is 
provided by following two results of [2], see also [1] for a simple exposition. 


Lemma 3 (The sewing map) Let u be a continuous function on [0, T]? with values 
in a Banach space B and £ > 0. Suppose that u satisfies 


IA (a, b) — wa, c) — u(c, b)| < Kijb=al™ VO<a<c<b<T. 
Then there exists a function Y u(t) unique up to an additive constant such that 
| £ u(b)— £ ula) ula, b)| < KA-2) b-a YO<a<b<T. (27) 


We adopt the notation Seu = fu(b)— Sula) 
Lemma 4 (Abstract Riemann sum) Let z = {a = tg < tj < --- < tm = b} bean 
arbitrary partition of [a, b] with || = supj-o,.... 4.1 |tit+1 — ti]. Define the Riemann 


sum 
m—1 


Jn = >) wi, tis1) 
i-0 


then Jz, converges to Jeu as |r| 4 0. 


Because t + Ay is strictly greater than 1, the estimate (19) together with the 
previous two Lemmas implies 


Proposition 2 Assume that (W) and ($) hold with Xy +t > 1. As the mesh size 
|x| shrinks to 0, the Riemann sums 


n 


> [W;, Qr 4) = W (tii, 1-1) 


i=l 


converges to f W (dt, gi). 


Remark 3 In [5], the authors define the nonlinear integral Í W (dt, q;) via the sewing 
Lemma 3. The previous proposition shows that the approach using fractional calculus 
employed here produces an equivalent definition. Let us note that this is possible 
because of the key estimate (19) and the uniqueness part of the sewing Lemma 3. 
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It is easy to see from here that 


b c b 
f weno e | wears e f wang Va<c<b. 
a a c 


This together with (13) imply easily the following. 


Proposition 3 Assume that (W) and ($) hold with Xy + v > 1. As a function 
of t, the indefinite integral Hr W(ds, gs), t<a< p} is Hölder continuous of 
exponent T. 


Further properties can be developed. For instance, we study the dependence of 
the nonlinear Young integration f W (ds, ps) with respect to the medium W and the 
integrand o. We state the following two propositions whose proofs are left for readers 
(see, however, [5] for details). 


Proposition 4 Let W and W be functions on R x R satisfying the condition (W). 
Let q be a function in CY (R; R7) and assume that t + Ay > 1. Then 


b b 
| | vias. oo -f W2 (ds, s)| € |W1 (b, pa) — W1 (a, Ga) — W2 (b, Ga) + W2 (a, Ga)| 
a a 


+ c(lella)DWi — Wolg.callellylb — a+” . 
Proposition 5 Let W be a function on R x R4 satisfying the condition (W). Let o! 


and ° be two functions in CY (R; RÌ) and assume that t + Ay > 1. Let 0 € (0, 1) 
such that t + Ory > 1. Then for any u < v 


vU vU 
| f weds. e — | weds, oto 
u u 


< CiWl1sle! — gl lv - ul* 
+CiWiale — 2M 9 y — ut , 


where C, is an absolute constant and Cy = 21-9 C, (|g! Ip + |o! be 


4 Iterated Nonlinear Integral 


From Remark 2 we see that if W(t, x) = ed gi(t)x; and g(t) = fi(t), then 
d b 
J? Wat, p) = 5 J fi (t)dgi (t). We know that the multiple (iterated) integrals of 


i=l a 
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the form 


i Q1, 52, .... Sn)dg(si)dg(s2) -- dg(sn) 


are well-defined and have applications in expanding the solutions of differential 
equations (see [3]). What is the extension of the above iterated integrals to the non- 
linear integral? To simplify the presentation, we consider the case d — 1. General 
dimensions can be considered in a similar way with more complex notations. 

We introduce the following notation. Let 


Anab = { (S1; ---; Sn); a L S1 xs S Sy zb) 


be a simplex in R”. 


Definition 2 Let o : Ana,» — R be a continuous function. For a fixed sn € 
[a, b], we can consider ¢(-, Sn) as a function of n — 1 variables. Assume we can 
define dis m Q(S1,--+5Sn—1, Sn) W (dsi, -) --- W(dsn_1, +), which is a function 
of Sn, denoted by ¢n—1(Sn), then we define 


b 
J QG1, ttt, Sn) W(ds1, ‘) ian W (dsn, -) = J W (dsp, Pn-1(Sn)) . (28) 


ASS| S055 <b 


In the case W(t, x) = f (t)x, such iterated integrals have been studied in [3], where 
an important case is when ọ(s1, ..., Sn) = p (s1) for some function p of one variable. 
This means that q(s1, ..., s,) depends only on the first variable. This case appears 
in the remainder term when one expands the solution of a differential equation and 
can be dealt with in the following way. 


Let F1, F5, ..., Fa be jointly Hólder continuous functions on [a, b]?. More pre- 
cisely, for each i = 1,...,n, F; satisfies 
|F; (s1, t1) — Fi(so, t1) — Fi (s1, t2) + Fi (s2, £2) (29) 


À . 
< | Fillea:apls1 — s217 lt — t2, for all sj, s2, t1, t2 in [a, b]. 


We assume that tT +A > 1. 

Suppose that F is a function satisfying (29) with t + A > 1. The nonlinear 
integral f F(ds,s) can be defined analogously to Definition 1. Moreover, for a 
Holder continuous function p of order A, we set G(s, t) = p(t) F (s, t), itis easy to 
see that 
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|G(s1, t1) — G(s2, t1) — G(s1, 2) + G(ti, t2)| 
< loet) — eG FGi. ti) — FGa.0)] 
t |o(t2)] FG1, t1) — F(s2, t1) — FGs1, 0) + Fi, t2)| 
< (elll + llellos IF llzals1 — sal" lti — tl. 


Hence, the integration f p(s)F(ds, s) is well defined. In addition, it follows from 
Theorem 2 that the map t — f. : p (S) F (ds, s) is Hélder continuous of order T. 
We have then easily 


Proposition 6 Let p bea Hólder continuous function of order à. Under the condition 
(29) and t > 1/2, the iterated integral 


lab Pirie Fn) = / D G1) Fi (dsi, $1) F2(ds2, $2) +++ Fa(dSn, Sn) 


ASS] SLS Sb 
(30) 
is well defined. 


In the simplest case when p(s) = 1 and F;(s, t) = f(s) for alli = 1,...,n, the 
above integral becomes 


_ (Ff) — f@)" 
mere cae 


n: 


J df (s1) --- df (Sn) 


a<s1 < sub 
Therefore, one would expect that 


|b = a |?» 


ab(Fi,...,Fr)l < K (31) 


This estimate turns out to be true for (30). 


Theorem 4 Let Fi, ..., F, satisfy 29) and p be Hólder continuous with exponent A. 


AR oe | 


1 
We assume that p(a) = 0. Denote B = E and l, = = 4 plat 


A). Then, for any Yn < £n, there is a constant Cy, independent of a and b (but may 
depend on ys) such that 


Ia, CFA, .--, Fn)| S Culb — al^. (32) 
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Proof Denote 


HUE uu Fo) = J p Gi) Fi(dsi, s1) F2(ds2, 52) +++ Fk(dsk, sk). 


a<s1 X mskms 


Thus, we see by definition that 


s 


(CF, i mo J Fei (dr, I, ..., Fi). (33) 


a 


We prove this theorem by induction on n. When n = 1, the theorem follows straight- 
forwardly from (19) with the choice c — a. Indeed, we have posed < C[t — a|^** 
and |J) — 112 < Clt — sl". 

The passage from n to n + 1 follows from the application of (23)-(33) and this 
concludes the proof of the theorem. 


Remark 4 'The estimate of Theorem 4 also holds true for the iterated nonlin- 
ear Young integral OF, 2o P.) = [ee Fi(dsi, p(s1)) Fo(ds2, 82) --- 
k b k-1 
Fa(dsn, Sn), where I'QCK,..., F) = f? Feds, lis (Fi, ..., Fk—1)), and 


IR) = J Fids, p(s)). 
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A Weak Limit Theorem for Numerical 
Approximation of Brownian Semi-stationary 
Processes 


Mark Podolskij and Nopporn Thamrongrat 


Abstract In this paper we present a weak limit theorem for a numerical approxima- 
tion of Brownian semi-stationary processes studied in [14]. In the original work of 
[14] the authors propose to use Fourier transformation to embed a given one dimen- 
sional (Lévy) Brownian semi-stationary process into a two-parameter stochastic field. 
For the latter they use a simple iteration procedure and study the strong approxima- 
tion error of the resulting numerical scheme given that the volatility process is fully 
observed. In this work we present the corresponding weak limit theorem for the 
setting, where the volatility/drift process needs to be numerically simulated. In par- 
ticular, weak approximation errors for smooth test functions can be obtained from 
our asymptotic theory. 


Keywords Ambit fields - Brownian semi-stationary processes : Numerical 
schemes + Weak limit theorems 
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1 Introduction 


Recently, the mathematical theory of ambit fields has been intensively studied in 
the literature. Ambit fields is a class of spatio-temporal stochastic processes that has 
been originally introduced by Barndorff-Nielsen and Schmiegel in a series of papers 
[9-11] in the context of turbulence modelling, but which found manifold applications 
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in mathematical finance and biology among other sciences; see e.g. [4, 8]. In full 
generality they are described via the formula 


X(x) = u+ f 


A(x 


80s. 0E) L(ds, d) + I ,, Mr Das Dd (1) 


where ¢ typically denotes time while x gives the position in space. Furthermore, A; (x) 
and D, (x) are ambit sets, g and q are deterministic weight functions, o represents 
the volatility or intermittency field, a is a drift field and L denotes a Lévy basis. We 
recall that a Lévy basis L = (L(B) : B € S}, where Z is a ó-ring of an arbitrary 
non-empty set S such that there exists an increasing sequence of sets ($5) C - with 
UneN Sn = S, is an independently scattered random measure. 

An important purely temporal subclass of ambit fields are the so called Lévy 
(Brownian) semi-stationary processes, which are defined as 


t 


t 
oo ee n J gt- ad, Q) 


—oo —oo 


where now L is a two-sided one dimensional Lévy (Brownian) motion and the ambit 
sets are given via A; = D, = (—oo, t). The notion of a semi-stationary process refers 
to the fact that the process (X;);em is stationary whenever (ar, 0;);em is stationary and 
independent of (L;); em. In the past years stochastic analysis, probabilistic properties 
and statistical inference for Lévy semi-stationary processes have been studied in 
numerous papers. We refer to [2, 3, 6, 7, 11, 12, 15, 17, 20, 25] for the mathematical 
theory as well as to [5, 26] for a recent survey on theory of ambit fields and their 
applications. 

For practical applications in sciences numerical approximation of Lévy (Brown- 
ian) semi-stationary processes, or, more generally, of ambit fields, is an important 
issue. We remark that due to a moving average structure of a Lévy semi-stationary 
process (cf. (2)) there exists no simple iterative Euler type approximation scheme. For 
this reason the authors of [13, 14] have proposed two different embedding strategies 
to come up with a numerical simulation. The first idea is based on the embedding 
of a Lévy semi-stationary process into a certain two-parameter stochastic partial 
differential equation. The second one is based upon a Fourier method, which again 
interprets a given Lévy semi-stationary process as a realization of a two-parameter 
stochastic field. We refer to the PhD thesis of Eyjolfsson [18] for a detailed analysis 
of both methods and their applications to modeling energy markets. We would also 
like to mention a very recent work [16], which investigates numerical simulations of 
spatio-temporal ambit fields. 

The aim of this paper is to study the weak limit theory of the numerical scheme 
associated with the Fourier method proposed in [14, 18]. In the original work [14] 
the authors have discussed the strong approximation error (in the L? sense) of the 
numerical scheme for Lévy semi-stationary processes, where the volatility process 
(0;);em is assumed to be observed. We complement their study by analyzing the weak 
limit of the error process in the framework of Brownian semi-stationary processes, 
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where the drift and the volatility processes need to be numerically simulated. This 
obviously gives a more precise assessment of the numerical error associated with the 
Fourier method. 

The paper is organised as follows. In Sect. 2 we describe the Fourier approximation 
scheme for Brownian semi-stationary processes and present the main results on strong 
approximation error derived in [14, 18]. Section3 is devoted to a weak limit theorem 
associated with a slight modification of the Fourier method. 


2 Basic Assumptions and Fourier Approximation Scheme 


We start with a complete filtered probability space (42, F, (.7),cm, IP), on which all 
processes are defined. We consider a Brownian semi-stationary process of the form 


t t 
aoe. J g(t — s)a, W (ds) + ri GG ahaa (3) 


where g and q are deterministic kernels, (a;);cR and (o;);eg are adapted càdlàg 
processes, and W is a two sided Brownian motion. To guarantee the finiteness of the 
first integral appearing in (3), we assume throughout the paper that 


t 
D gt — 5)o2ds «oo almost surely (4) 


—oo 


for all t € IR. When (0;);eR is a square integrable stationary process, the above 
condition holds if g € L? (R30). The presence of the drift process (a;);¢jR will be 
essentially ignored in this section. 

Now, we describe the Fourier approximation method introduced in [14, 18] 
applied to the framework of Brownian semi-stationary processes. We start with the 
following assumptions on kernels involved in the description (3): 


Assumption (A): 


(i) The kernel functions g and q have bounded support contained in [0, t] for some 
r0. 
(ii) g,q € C(R»o). 


In some cases these conditions are rather restrictive. We will give remarks on them 
below. For any given à > 0, we define 


A(x) := g(xD and Ahi():- h(x) exp(alx]). (5) 


104 M. Podolskij and N. Thamrongrat 


Notice that g = h on [0, t]. We introduce the Fourier transform of A; via 
ho) = L hi (x) exp(—ixy)dx. 


Furthermore, if we assume that hi er! (IR), the inverse Fourier transform exists and 
we obtain the identity 


—A ae 
ines d f epad. 


Since the Fourier transform maps L! (R) functions into the space of continuous 
functions, we require that h € C(IR). This fact explains the Assumption (A)(ii) for 
the kernel function g. Since h is an even function, for a given number N € N, we 
deduce an approximation of h via 


bo Š k 
h(x) ~ hy (x) = expC-AlxD) (3 +> sett) (6) 
k=1 
with p 
"e mal) a (7) 


Obviously, the above approximation is an L?-projection onto the linear subspace 
generated by orthogonal functions {cos(kwx/t), sin(kz x / Fs hence we deal 
with a classical Fourier expansion of the function A (recall that the function h is even 
by definition, thus the sinus terms do not appear at (6)). Now, the basic idea of the 
numerical approximation method proposed in [14, 18] is based upon the following 
relationship: 


t t 
f g(t — s)os W (ds) © f hy (t — s)os W (ds) 


t N PEE 
= f exp(—A(t — s)) | i + Sbr edem) os W (ds) 
i k=1 
bo = NR kr 
= 5 Fru (t, 0) VeL a Tk (8) 


where the complex valued stochastic field £ a.u(t, y) is defined via 


t 
Fault, y) = exp{(—A + iy)(t — s)}os W (ds) (9) 


u 


A Weak Limit Theorem for Numerical Approximation ... 105 


andu € [t — v, t]. In a second step, fora ô > 0 small, we observe the approximation 


~ ~ t+ô 
Xj uí + ô, y) = exp{(—À + iy)ô} (Brut, y) «f exp{(—A + iy)(t — 9wa) 
t 


& exp{(—A + iy)ô} (Rault, Y) + o (Wis — W;)). (10) 


Hence, we obtain a simple iterative scheme for simulating the stochastic field 
£ X.u (t, y) in the variable t. Assume for the moment that the drift process a is zero and 
we wish to simulate the trajectory of X;,,..., X;,, given the information available 
at time fo. Then, the numerical simulation procedure is as follows: 


(a) Simulate the independent increments W, — W, , ~ (0, t; — ti—1) fori = 
1,..., M. 

(b) For each į = 1,..., M and k = O,..., N, simulate X; ,(t;, kr/t) from 
X5 u(ti-1, kr/t), Wi, — Wi,_, ando; , by using (10). 

(c) Simulate X, applying steps (a), (b) and (8) (with u = to). 


Let us explain some properties of the proposed numerical scheme. First of all, there 
are two approximation errors, where the first one (N scale) is coming from the Fourier 
transformation at (6) and the second one (M scale) is coming from the discretization 
error obtained at (10). 

It is important to understand the meaning of knowing the information about the 
involved processes up to time to. When the stochastic model for the process (o;);em 
is uncoupled with (X;);eg, then we may use u = t — r at (8). Indeed, in typical 
applications such as turbulence and finance this is the case: (0; );<jp is usually modeled 
viaa jump diffusion process driven by a Lévy process, which might be correlated with 
the Brownian motion W. However, when the process (X;);em is itself of a diffusion 
type, i.e. 


t t 
X,—u «f g(t — s)o (X,)W (ds) «f q(t — s)a(X;)ds 
=F f= 


T 


it is in general impossible to simulate a trajectory of (X;);em, since for each value t 
the knowledge of the path (X,,)ve(r—1,r) is required to compute X+. But, in case we 
do know the historical path, say, (X,)ue[-c,0], the simulation of values X;, t > 0, 
becomes possible. 

The main advantage of the numerical scheme described above is that it separates 
the simulation of the stochastic ingredients (o and W) and the approximation of 
the deterministic kernel g (or /). In other words, the stochastic field X r.u(t, y) is 
simulated via a simple recursive scheme without using the knowledge of g, while 
the kernel g is approximated via the Fourier transform at (6). This is in contrast to a 
straightforward discretization scheme 


0 


tj j-1 
f g(t — s)os W (ds) & V 8(tj — ti)on (Wigi — Wi). 
t i=l 
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This numerical property is useful when considering a whole family of kernel func- 
tions (gọ)oọco, since for any resulting model X;(0) only one realization of the sto- 
chastic field X X.u(t, y) needs to be simulated. This can be obviously useful for the 
simulation of parametric Brownian semi-stationary processes. 

We may now assess the strong approximation error of the proposed numerical 
scheme. We start with the analysis of the error associated with the approximation of 
the deterministic kernel g by the function hy. We assume for the moment that the 
volatility process (o;);eg is square integrable with bounded second moment. Then 
a straightforward computation (see e.g. [14, Eq. (4.5)]) implies that 


2 
t oo 
B[([ (es) - ii - nea | sc REC [Y yi) 
to ^ k=N+1 
(11) 


where C is a positive constant and the Fourier coefficients bg have been defined 
at (7). We remark that (1 — exp(—2A(t — to)})/A — 2(t — to) as A — O0, while 
(1 — exp( -2A (t — t9)D/X ~ àT! as A — oo. Thus, it is preferable to choose the 
parameter à > 0 large. 


Remark 1 A standard model for the kernel function g in the context of turbulence 
is given via 7 
g(x) = x* exp(—Ax) 


with A > 0 anda > —1/2. Obviously, this function has unbounded support and for 
the values œ € (—1/2, 0) itis also discontinuous at 0, hence it violates the statement 
of the Assumption (A). However, one can easily construct an approximating function 
gI , which coincides with g on the interval [¢, T] and satisfies the Assumption (A). 
Assuming again the boundedness of the second moment of the process (o;);em, the 
approximation error is controlled via 


: 2 
| (tee —5) = gf (¢—s)}o,W(ds)) | E NR 


Such error can be made arbitrary small by choosing & small and T large. Clearly, 
this is a rather general approach, which is not particularly related to a given class 
of kernel functions g. In a second step one would apply the Fourier approxima- 
tion method described above to the function gl . At his stage it is important to note 
that the parameter A > 0 introduced at (5) is naturally restricted through the con- 
dition A < À; otherwise the kernel 4, would have an explosive behaviour at oo. 
Thus, the approximation error discussed at (11) cannot be made arbitrarily small 
in A. 


Remark 2 The Fourier coefficients bg can be further approximated under stronger 
conditions on the function h, which helps to obtain an explicit bound at (11). More 
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specifically, when h € C?" (R) and hU/ P (r) = O forall j = 1,..., n, then it holds 
that 
Ib] x CK”. 


This follows by a repeated application of integration by parts formula (see [14, 
Proposition 4.1] for a detailed exposition). In fact, the original work [14] defines 
another type of smooth interpolation functions h, rather than the mere identity h(x) = 
g((x|), to achieve that the relationship h^ (r) = 0 holds for all j = 1, ..., n and 
some n € N. 


Now, let us turn our attention to the discretization error introduced at (10). We assume 
that to <--- < ty is an equidistant grid with t; — tj: = At. According to (10) the 
random variable 


J 
nj(y) = > expt(-A + iy) + 1 Ados (Wa — Wii) (12) 
i=l 


is an approximation of X 1,19 (tj, Y) for any y € R whenever the drift process a is 
assumed to be absent. When (o;);em is a weak sense stationary process, a straight- 
forward computation proves that 


BR, (5. y) — 0; PI x Cj — t) (02 + yA? + Ellon — os). 13) 


We refer to [14, Lemma 4.2] for a detailed proof. 


Remark 3 Assume that the process (0;);em is a continuous stationary Itó semimartin- 
gale, i.e. 
do; = a dt + 9;dB;, 


where B is a Brownian motion and (à;);em, (9;);em are stochastic processes with 
bounded second moment. Then the Itó isometry implies that 


illon — opl] € CAt. 


Hence, in this setting At becomes the dominating term in the approximation error 
(13). 


Combining the estimates at (11) and (13), we obtain the strong approximation error 
of the proposed Fourier method, which is the main result of [14] (see Propositions 
4.] and 4.3 therein). 


Proposition 1 Letto < --- < ty bean equidistant grid with t; — tjj = At. Assume 
that condition (A) holds and (o;),eg is a weak sense stationary process. Then the L? 
approximation error associated with the Fourier type numerical scheme is given via 
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) n (t; — 5)e was- (2 +> yr (14) 
5 a SM j s 5j = kj " 
] — exp{—-2A t- 2) y & 2 
<C |bx| 
(Senn È e) 


2( lbol z 2 2 T. x 2 2 
t0; = 10) (I + Dal) Cao? + C (kb) can 
k=1 


k=1 
N 
Ibo 2 
+(5 +2, ib) o, - I} 


for a positive constant C. 


3 A Weak Limit Theorem for the Fourier 
Approximation Scheme 


As we mentioned earlier, the Fourier approximation scheme investigated in [14, 18] 
basically ignored the need of simulating the volatility process (o;);em in practical 
applications (the same holds for the drift process (a;);em). As in the previous section 
we fix a time to and assume the knowledge of all processes involved up to that 
time. Here we propose a numerical scheme for simulating the path (X;);ep;g, r] for 
a given terminal time T > fo, which is a slightly modified version of the original 
Fourier approach. We recall the imposed condition (A), in particular, the weight 
functions g and q are assumed to have bounded support contained in [0, t]. First 
of all, we assume that we have càdlàg estimators (a/^ , o); 7, of the stochastic 
process (ar, 9;);e(;i, T] and the convergence rate vy — oo as M — oo such that the 
following functional stable convergence holds: 


"T (a™ a,o™ o) ^5  =(U!,U2) onD^Q[t,T], (15) 


where the convergence is on the space of bivariate càdlàg functions defined on [to, T] 
equipped with the Skorohod topology D? ([to, T ]). Let us briefly recall the notion of 
stable convergence, which is originally due to Rényi [27]. We say that a sequence of 
random variables Y" with values in a Polish space (E, &) converges stably in law 
to Y, where Y is defined on an extension (42^, F’, P^) of the original probability 
(2, ¥, IP) if and only if 


lim E[f(Y)Z] = E'[f(Y)Z] 


noo 


A Weak Limit Theorem for Numerical Approximation ... 109 


for any bounded and continuous function f and any bounded .¥ -measurable random 


variable Z. In this case we write (Y" E: Y). In the following we will deal with the 
space of càdlàg processes equipped with the Skorohod topology or with the space 
of continuous processes equipped with the uniform topology. We refer to [1, 23] or 
[27] for a detailed study of stable convergence. Note that stable convergence is a 
stronger mode of convergence than weak convergence, but it is weaker than uniform 
convergence in probability. 

We remark that the estimators (aM reiro, T] and (o, M )ref,7] might have a different 
effective convergence rate. In this case we will have either U; = 0 or U2 = 0. 

Now, we basically follow the Fourier type approach, which refers to (5) and the 
definition of the function hi, described in the previous section, but we replace the 
Fourier transform approximation proposed at (6) by a Riemann sum approximation. 
More specifically, we introduce the approximation 


—AÀ EN 
n(x) = ZEI [ ja) expixy)dy 


Ai hy) — MD » hi (x) cos (=) ; (16) 


k=0 


where cy is a sequence of numbers in N satisfying cy/N — oo as N — oo. In 
the following we will also assume that the sequence cy additionally satisfies the 
condition 


oo 
vf Ih;(y)dy — 0 asN — oco. (17) 
cn/N 


Clearly, such a sequence exists, since h; € L'(R). When introducing the approxima- 
tion at (16), we obviously obtain two types of error: The Riemann sum approximation 
error and tail approximation error. Condition (17) guarantees that the Riemann sum 
approximation error will dominate. 


Remark 4 Under some stronger conditions the tail integral at (17) can be bounded 
from above explicitly. Assume that € C ?([—r, t]) such that h’(t) = 0 (cf. Remark 
2). Then a repeated application of integration by parts formula implies the identity 


T 


fit) f h. G0 cosa) = —— f h} (x) cos(yx)dx 
R T 


for any y > 0. Thus, for any u > 0, we deduce the inequality 


oo P oo 
f Ihi @)ldy < cis f y ?dy < CUA uw. 
u u 


Hence, condition (17) holds whenever N? /cN > Qas N > oo. 
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Remark 5 We remark that the Fourier transform used at (6) comes from the L? 
theory. Thus, in contrast to the L?-distance ||h — hy || 12, the limiting behaviour of a 
standardized version of A (x) — hy (x) is difficult to study pointwise. This is precisely 
the reason why we use the Riemann sum approximation instead, for which we will 
show the convergence of N (h(x) — hy (x)). 

If one can freely choose the simulation rates N and M, the Fourier transform of 
(6) is numerically more preferable. According to the estimate (11) and the upper 
bound for the Fourier coefficient of Remark 2 applied for n = 1, we readily deduce 
the rate NT! for the L?-error approximation connected to (6). On the other hand, the 
effective sample size of the Riemann approximation at (16) is cy. In the setting of 
the previous remark the overall Riemann approximation error is max(N~!, N/cy). 
Recalling that cy/N — oo, the obtained rate is definitely slower than the one 
associated with Fourier approximation proposed at (6). 

Nevertheless, as our aim is to precisely determine the asymptotics associated with 
the N scale, we will discuss the Riemann approximation approach in the sequel. A 
statement about the Fourier transform (6) will be presented in Remark 8. 


Now, we essentially proceed as in the steps (8)-(10). First of all, it holds that 


t t 
f st 9o f hy (t — s)os W (ds) 


u u 


a] exp(—A(t — s) [25 uer >) c, W (ds) 
ü k=0 w 


CN PEEN k 
= Re S byX, ut, N^ (18) 
k=0 


where the complex valued stochastic field x a.u(t, y) is defined at (9) and b, = 
h, (k/ N)/ (1t N). In a second step, for 6 > 0, we obtain the approximation 


+d 


. 
Xj u(t +4, y) = exp{(—A + iy)8] (Bac y)+ f exp{(—A + iy)(t — sje, (as)) 
$ 


t+ô 
& exp{(—A + iy)d} (Bat yt 1 exp{(—A + iy)(t — ie was) : 
t 


(19) 
When the estimator o is assumed to be constant on intervals [si-1, 5$), 1. = 
1,..., M, the last integral at (19) can be easily simulated (cf. (10)). We remark 
that this approximation procedure slightly differs from (10) as now we leave the 
exponential term unchanged. 
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In summary, given that the information up to time fo is available, we arrive at the 
simulated value 


t 


t 
x^ = hy - 2o Wels) + | q(t — s)a" ds (20) 
to 


to 


of the random variable 


t 


t 
der «t = sowas) + f q(t — s)asds. (21) 
to 


to 


Note that the drift part of the Brownian semi-stationary process X is estimated in a 
direct manner, although other methods similar to the treatment of the Brownian part 
are possible. Now, we wish to study the asymptotic theory for the approximation 
error X "i M - X o, Our first result analyzes the limiting behaviour of the function 
N (h(x) — hy (x)). 
Lemma 1 Define the function Yy (x) := N (h(x) — hw (x)). Let us assume that the 
condition I p 

yh) e L'(R, — y?hi() e L'() Q2) 


holds. Then, under Assumption (A), (17) and (22), it holds that 


h, (0) 


vna) > va) eee 


exp(—A|x|) | as N — oo (23) 


for any x € R. Furthermore, it holds that 


sup IWn(x)| x € 
NeN, xe[0,T] 


for any T > 0. 


Proof First, we recall a well known result from Fourier analysis (see e.g. 
[19, Theorem 8.22]): The condition (22) implies that 


h, € LIR), RM eLl(g. (24) 
Now, observe the decomposition 


_ Nexp(-A|xp ve. fD k 
yna) = SAPARD > n " (so z ecd) dy 
, Napad [7 


ui (cy+1)/N 


Nexp(—Alx|) < ‘cee ( k ) 
a 0) — (>) ) dy +001), 
= 2. a KO) — kk C) J dy Ho) 


Kx (y)dy 
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where ky(y) = hi (y) cos(yx) and the approximation follows by the inequality 
lc C)| < |Aa(y)| and condition (17). Let us denote by «/ (y) the derivative of 
Kx (y) with respect to y. Since «/(-), «/(*) € L!(R=>0) because of (24), we deduce 
that 


N =X CN (k+1)/N k k 
pa = VOY | «Gp (»- s )e co 


k=0 "K/N N 
exp(—Alx|) xe ,_ k 
E = 1 
2nN 2.) +00) 


"x oo 
> D | k.(y)dy | as N — oo. 
27 0 E 
But, since hy vanishes at infinity, we readily obtain that 
oo ~ 
| Kk, Q)dy = —h; (0). 
0 


In order to prove the second assertion of the lemma, we observe the inequality 


exp(—Alx|) < ee 
k=0 K/N 


wee Hanoy +N | ficta 


where £y. N Cy) is a certain value with Zi. w(y) € (k/N, y). Clearly, the second term 
in the above approximation is bounded in N, since it converges to 0. On the other 
hand, we have that [&/ (y)| < IxlI/ra @)| + Ik, Cy)], and since hy, À € L'(Rz9), we 
readily deduce that 


sup |Wn(x)| s C. 
NEN, xe[0,T] 


This completes the proof of the lemma. 


At this stage we need a further condition on the kernel function g to prove tightness 
later. 


Assumption (B): 
(i) The kernel function g has the form 


g) = x* f(x) 


for some o > 0 and function f satisfying f(0) Æ 0. 
adi) fec l (R>o) has bounded support contained in [0, t]. 
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Notice that the assumptiona > (isin accordance with the condition (A)(ii). Assump- 
tion (B) implies the following approximation result: 


a=0 


a>0O 


: à Cât 
[ IgG +ô) — gœ) dx < | ogmin(4,4a+1) (25) 


for 6 € [0, T]. The case a = O is trivial, while the other one follows along the lines 
of the proof of [20, Lemma 4.1]. As a matter of fact, we also require a good estimate 
of the left side of (25) when the kernel g is replaced by the function wy defined in 
Lemma 1. In the following we will assume that 


1 
sup | IYn (x +8) — Wn æ) dx < C8! (26) 
NcN40 

for some ¢ > 0 and ô € [0, T]. 


Remark 6 Unfortunately, we have not been able to show the statement of (26) under 
the mere assumption of, say, condition (B). Obviously, as in the case of function g, 
condition (26) would hold if 


VN (x) = x” fy (x), 
where fy € C HR.) with uniformly bounded derivative in N € N and x ina 
compact interval. We can prove condition (26) explicitly when the function g is 


differentiable. Assume that yh, (y), yh), (ye L! (Ro) and cy is chosen in such a 
way that the condition 


oo 
n f lyhi@)lddy > 0 as N — oo 
cN/N 


is satisfied. As in Lemma 1 we conclude that (0, 9, (y)| < m (y)|+ lyxha, (y)|+ 
|yh},(y)|) and, as in the proof of Lemma 1, we deduce that 


oo oo 
wp wols c (wv f bi onyx f lh, ()Idy 
xe[0,T] cn/N cn/N 


CN (k+1)/N CN 
«Mj yc Gy | 
k=0 


tco 4 k/N UN 


(k+1)/N 


cs Gron) 


for certain values Zi. (y), Tr, N (y) in the interval (k/N, y). Then, due to our inte- 
grability conditions, we obtain 


sup lv (x)] « oo. 
NEN, xe[0,T] 
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Moreover, condition (26) is trivially satisfied due to mean value theorem. However, 
showing (26) under Assumption (B) seems to be a much harder problem for o € 
(0, 1). 


The next result is the main theorem of our paper. 


Theorem 1 Assume that conditions (A), (B), (17), (22) and (26) hold, and the 
processes (0;) req, r] and (o) etn. 7) has finite fourth moment with SUP;er. T] v[0;] 
< oo and SUP; ef),7] SUPMeN [(0,4)^] < oo. We also assume that the process 


U~ = vu (o4 — or) satisfies 


sup sup =U" )*] « oo. (27) 
te[to, T] MEN 


Then we obtain the decomposition 


x = eua nM 


such that 
Tm hj) [ 
NA’ = A= = exp(—A(- — s))osW (ds) as N,M —> œ, (28) 
to 


p 
where ucp convergence means that sup; 1,7 |AN — A;| — 0, and 


ds 
wn” 55 B= | 


to 


«coU was + | ac- syusas as M — oo, (29) 


to 


where the stable convergence holds on the space C ([to, T ]) equipped with the uniform 
topology. 


Proof We start with the decomposition X H MX 0 — AN.M 4 BM where 


t 
AP" = [ (iut - 9 - et no? wa. 
to 


0 


t t 
RY =) g(t — s)(o — o,) W (ds) «f q(t —s){a™ — as}ds. 
to t 


We begin by proving the stable convergence in (29). Let us first recall a classical result 
about weak convergence of semimartingales (see [23, Theorem VI.6.22] or [24]): 


Let (Y7),e(4, r] be a sequence of càdlàg processes such that Y" = Y on D([to, T]) 
equipped with the Skorohod topology. Then we obtain the weak convergence 


L Y? W (ds) => f Y,W(ds) on C([to, T]) 
to to 
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equipped with the uniform topology. This theorem is an easy version of the general 
result, since the integrator W does not depend on n and hence automatically fulfills the 
P-UT property. The stable nature ofthe aforementioned weak convergence follows by 
joint convergence (Io Y? W (ds), Y", W) => (Jr Y,W (ds), Y, W) (cf. [24]). Hence, 
we deduce that 


] wes) YW(ds) on C(t, TI) (30) 
0 0 


equipped with the uniform topology. It is important to note that this result can not 
be directly applied to the process BY , since this process is not a semimartingale 
in general. Thus, we will prove the stable convergence (29) by showing the stable 
convergence of finite dimensional distributions and tightness. 

We fix u1,...,ux € [to, T]. Due to the condition (15), the finite dimensional 
version of (30) and continuous mapping theorem for stable convergence, we conclude 
the joint stable convergence 


({ow y g(uj — s)(o = os}W(as)} vu f qo — sa” — asjas) 
to Jj=l,...,k to 


d ({ guj- suwa») | "i qC— 22) (31) 
f 2 » 


EMEN LL Ce 


as M — oo. Here we remark that the stable convergence for the second compo- 
nent indeed holds, since the mapping F : C([to, t]) x D([to, T) — C([to, T], 
F(q,a) = Jo q(- — s)asds is continuous. Hence, we are left with proving tightness 


for the first component of the process B~ . We fix u,t € [to, T] with t > u and 
observe the decomposition 


t u 
VM (/ g(t — s)to — o3W (ds) — ji g(u — 5)(oM — awas) 


0 to 


t 
= (/ g(t — so —0,)W(ds) 


u 


+ / ditte gür- 5)Ho? — gawas) = RQ (ru) + ROW, u). 
to 


Using Burkholder and Cauchy-Schwarz inequalities and (27), we have 


t 
URO Gas CE — w | lg(t — s)|^ds. 


Thus, we conclude that 


RO t, w4] < CU uy (32) 
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Now, using the same methods we conclude that 


Uu . 
ERO Cw e € f let — s) — glu — s)|*ds < C(t — u) ™™ état (33) 
to 


where we used the inequality (25). Thus, applying (32), (33) and the Kolmogorov’s 
tightness criteria, we deduce the tightness of the first component of the process Be : 
This completes the proof of (29). 

Now, we show the pointwise convergence at (28). Recalling the notation from 
(23), we need to show that 


t 
f (V (t —5) — yt- sjo W(ds) > 0 as N, M — oo 
to 


for a fixed t. The Itô isometry immediately implies that 


sup E | 
MeN 


t 2 t 
f Wort — s) — we = 5))o Wds)| | < cf (YNG — 5) — Y — s) ds 
to to 


—0 as N — oo, 


which follows by Lemma 1 and the dominated convergence theorem. Hence, we 
obtain pointwise convergence at (28). Since the limiting process A is continuous, we 
now need to show that 


sup E[N4(AN™ — ANM)*] < C(t — u)!t 
N,MeN 


for fy < u < t, to conclude ucp convergence from pointwise convergence in proba- 
bility. Applying the same methods as in (32), (33) we deduce the inequality 


sup E[N*(Aj^" — ayy‘) 
N,MeN 


t u 

sc(e-wf vt — las + | uwa — 9) — Vu — otas) 
u to 

< C(t- u)! t, 


which follows by Lemma 1 and condition (26). This completes the proof of 
Theorem 1. 


Remark 7 We remark that the stronger conditions (B) and (26) are not required to 
prove the finite dimensional version of convergence (28) and (29). 


Theorem 1 immediately applies to the weak approximation error analysis. Assume for 
simplicity that M = M(N) is chosen such that vm /N — 1, so that the Riemann sum 
approximation error and the simulation error from (15) are balanced. We consider a 
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bounded test function o € C 1 (R) with bounded derivative. The mean value theorem 
implies the identity 


o(XN™) — o(x9) = g'n y) (XI — x0, 


, i P 
where £y. m is arandom value between X? and AM withy, y —> X? as N — oo. 


By properties of stable convergence we deduce that (Ey m, N(X y Ma x?) ELS 


(X a A; + B;). Hence, given the existence of the involved expectations, we conclude 
that 


TEXN] — E[g(X9)] = N7le'to (X9) (A; + B)]--o(N-). 64 


(Recall that the limit A; + B; is defined on the extended probability space 
(2', F', P^). 


Remark 8 The results of Theorem 1 may also apply to the original Fourier approxi- 
mated method proposed in [14, 18]. Let us keep the notation of this section and still 
denote the approximated value of X H by X y M, Recalling the result of (11) (see also 
Remark 2) and assuming that M = M(N) is chosen such that 20 3d Ib | « vm, 
we readily deduce that 


ds 
mie — x9) £5 B. 


Remark 9 The results of Theorem 1 might transfer to the case of Lévy semi- 
stationary processes 


t t 
X,— uo 1 g(t — s)o, L(ds) + / q(t — s)asds 
oo oo 


under suitable moment assumptions on the driving Lévy motion L (cf. [14]). How- 
ever, when L is e.g. a B-stable process with 6 € (0, 2), it seems to be much harder 
to access the weak limit of the approximation error. 


In the following we will present some examples of convergence at (15) to highlight 
the most prominent results. For simplicity we assume that a = 0 in all cases. 


Example I Let us consider a continuous diffusion model for the volatility process 
c,le. 
do; = a(o;)dt + v(o;)dB;, Ot = XQ, 


where B is a Brownian motion possibly correlated with W. We consider an equidistant 
partition f = so < sj < +- < sy = T of the interval [to, T] and define the 


continuous Euler approximation of o; via 


a = a + aloa — sk) + V(o, (B; — Bs,), t € [sk, Sk41]. 
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When the functions a and v are assumed to be globally Lipschitz and continuously 
differentiable, it holds that 


dst 
VM(o” —o) 2S U?  onC([to, T]), 
where U? is the unique solution of the stochastic differential equation 
5 25 Y 
dU? = à'(o;) U2dt + V (o;)U2 dB, — yg naw. 


where W' is a new Brownian motion independent of .Z. We refer to [22, Theorem 
1.2] for a detailed treatment of this result. 


Example 2 Let us now consider a discontinuous diffusion model for the volatility 
process o, i.e. 
do; = v(o; )dL;, Om = X0, 


where L is a purely discontinuous Lévy process. In this framework we study the 
discretized Euler scheme given via 


M ~ M 
odo v(o,. VL) — Ls). k=0,...,M—1. 
We define the process UM = ofin jm — OltM\/M- In [21] several classes of Lévy 
processes L has been studied. For the sake of exposition we demonstrate the case of a 


symmetric f-stable Lévy process L with B € (0, 2). Let us assume that v € C 3(R). 
Then, it holds that 


(M/log(M) PUM P", U? on D([to, TI), 


where U? is the unique solution of the linear equation 

dU? = V (o; )U? dL, — VV (o; .)dL, 
and L’ is another symmetric f-stable Lévy process (with certain scaling para- 
meter) independent of F. We note that this result does not directly correspond 


to our condition (15) as the discretized process op m]/m is used in the definition 
of UM. 
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Non-elliptic SPDEs and Ambit Fields: 
Existence of Densities 


Marta Sanz-Solé and André Süß 


Abstract Relying on the method developed in [11], we prove the existence of a 
density for two different examples of random fields indexed by (t, x) € (0, T] x R^. 
The first example consists of SPDEs with Lipschitz continuous coefficients driven 
by a Gaussian noise white in time and with a stationary spatial covariance, in the 
setting of [9]. The density exists on the set where the nonlinearity o of the noise does 
not vanish. This complements the results in [20] where o is assumed to be bounded 
away from zero. The second example is an ambit field with a stochastic integral term 
having as integrator a Lévy basis of pure-jump, stable-like type. 


Keywords Stochastic partial differential equations * Stochastic wave equation - 
Ambit fields * Densities 


1 Introduction 


Malliavin calculus has proved to be a powerful tool for the study of questions con- 
cerning the probability laws of random vectors, ranging from its very existence to the 
study of their properties and applications. Malliavin's probabilistic proof of Hórman- 
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der's hypoellipticity theorem for differential operators in quadratic form provided the 
existence of an infinitely differentiable density with respect to the Lebesgue measure 
on IR" for the law at a fixed time t > 0 of the solution to a stochastic differential 
equation (SDE) on R” driven by a multi-dimension Brownian motion. The classical 
Malliavin's criterion for existence and regularity of densities (see, e.g. [13]) requires 
strong regularity of the random vector X under consideration. In fact, X should be 
in the space D, meaning that it belongs to Sobolev type spaces of any degree. 
As a consequence, many interesting examples are out of the range of the theory, for 
example, SDE with Hólder continuous coefficients, and others that will be mentioned 
throughout this introduction. 

Recently, there have been several attempts to develop techniques to prove exis- 
tence of density, under weaker regularity conditions than in the Malliavin's theory, 
but providing much less information on the properties of the density. The idea is to 
avoid applying integration by parts, and use instead some approximation procedures. 
A pioneer work in this direction is [12], where the random vector X is compared 
with a good approximation X^ whose law is known. The proposal of the random 
vector X° is inspired by Euler numerical approximations and the comparison is done 
through their respective Fourier transforms. The method is illustrated with several 
one-dimensional examples of stochastic equations, all of them having in common 
that the diffusion coefficient is Hélder continuous and the drift term is a measurable 
function: SDEs, including cases of random coefficients, a stochastic heat equation 
with Neumann boundary conditions, and a SDE driven by a Lévy process. 

With a similar motivation, and relying also on the idea of approximation, 
A. Debussche and M. Romito prove a useful criterion for the existence of den- 
sity of random vectors, see [11]. In comparison with [12], the result is formulated in 
an abstract form, it applies to multidimensional random vectors and provides addi- 
tionally information on the space where the density lives. The precise statement is 
given in Lemma 1. As an illustration of the method, [11] considers finite dimensional 
functionals of the solutions of the stochastic Navier-Stokes equations in dimension 3, 
and in [10] SDEs driven by stable-like Lévy processes with Hólder continuous coef- 
ficients. A similar methodology has been applied in [1, 2, 4]. The more recent work 
[3] applies interpolation arguments on Orlicz spaces to obtain absolute continuity 
results of finite measures. Variants of the criteria provide different types of proper- 
ties of the density. The results are illustrated by diffusion processes with log-Hólder 
coefficients and piecewise deterministic Markov processes. 

Some of the methods developed in the references mentioned so far are also well- 
suited to the analysis of stochastic partial differential equations (SPDEs) defined by 
non-smooth differential operators. Indeed, consider a class of SPDEs defined by 


Lu(t, x) = b(u(t, x)) + o (u(t, x)) F(t, x), (t, x) € (0, T] x RÍ, (1) 


with constant initial conditions, where L denotes a linear differential operator, 
c, b:R — R, and F is a Gaussian noise, white in time with some spatial corre- 
lation (see Sect. 2 for the description of F). Under some set of assumptions, [20, 
Theorem 2.1] establishes the existence of density for the random field solution of (1) 
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at any point (t, x) € (0, T] x R, and also that the density belongs to some Besov 
space. The theorem applies for example to the stochastic wave equation in any spatial 
dimensions d > 1. 

The purpose of this paper is to further illustrate the range of applications of 
Lemma 1 with two more examples. The first one is presented in the next Sect.2 and 
complements the results of [20]. In comparison with this reference, here we are able 
to remove the strong ellipticity property on the function c , which is crucial in most of 
the applications of Malliavin calculus to SPDEs (see [19]), but the class of differential 
operators L is more restrictive. Nevertheless, Theorem 1 below applies for example 
to the stochastic heat equation in any spatial dimension and to the stochastic wave 
equation with d < 3. For the latter example, if o, b are smooth functions and o is 
bounded away from zero, existence and regularity of the density of u(t, x) has been 
established in [16, 17]. 

The second example, developed in Sect. 3, refers to ambit fields driven by a class 
of Lévy bases (see 14). Originally introduced in [5] in the context of modeling 
turbulence, ambit fields are stochastic processes indexed by time and space that are 
becoming popular and useful for the applications in mathematical finance among 
others. The expression (14) has some similarities with the mild formulation of (1) 
(see 3) and can also be seen as an infinite dimensional extension of SDEs driven by 
Lévy processes. We are not aware of previous results on densities of ambit fields. 

We end this introduction by quoting the definition of the Besov spaces relevant 
for this article as well as the existence of density criterion by [11]. 

The spaces By oo 5 > 0, can be defined as follows. Let fm — R.Forx,h € R4 


set (ALS = f(x +h) — f(x). Then, for any n € N, n > 2, let 
ARP) = (A, (AR! A) @) = ze b iG rece im. 


For any 0 « s « n, we define the norm 


Iflg: n = Ifl + i p al As flu. 


It can be proved that for two distinct n, n' > s the norms obtained using n or n’ are 
equivalent. Then we define B1 „ to be the set of L -functions with || f || Bia) < O0. 
We refer the reader to [22] for. inore details. 

In the following, we denote by 167 the set of bounded Hölder continuous functions 
of degree a. The next Lemma establishes the criterion on existence of densities that 
we will apply in our examples. 
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Lemma 1 Let « be a finite nonnegative measure. Assume that there exist 0 < a < 
a < l, n € Nanda constant C, such that for all à € €*, and all h € R with 
Ih| x 1, 


| f Ano )k (dy) 
R 


< C, |o lee lhl". (2) 


Then x has a density with respect to the Lebesgue measure, and this density belongs 
to the Besov space BY SS R). 


2 Nonelliptic Diffusion Coefficients 


In this section we deal with SPDEs without the classical ellipticity assumption on the 
coefficient o, i.e. inf, cpa |o (x)| > c > 0. In the different context of SDEs driven 
by a Lévy process, this situation was considered in [10, Theorem 1.1], assuming in 
addition that o is bounded. Here, we will deal with SPDEs in the setting of [9] with 
not necessarily bounded coefficients o. Therefore, the results will apply in particular 
to Anderson’s type SPDEs (o (x) = Ax, A Æ 0). 

We consider the class of SPDEs defined by (1), with constant initial conditions, 
where L denotes a linear differential operator, and c, b:R — R. In the definition 
above, F is a Gaussian noise, white in time with some spatial correlation. 

Consider the space of Schwartz functions on R4, denoted by Z (Rd ), endowed 
with the following inner product 


Ware | ay | rdx (y) v Cy — x), 
Rd Rd 


where I" is a nonnegative and nonnegative definite tempered measure. Using the 
Fourier transform we can rewrite this inner product as 


(ve - f, U(dE)\F Q(5).7 YE), 


where u is a nonnegative definite tempered measure with Fu = F. Let #7: = 
CZ, (C, joa ner and #7: = L?([0, T]; #). It can be proved that F is an iso- 
normal Wiener process on #7. 

Let A denote the fundamental solution to Lu = 0 and assume that A is either a 
function or a non-negative measure of the form A(t, dy)dt such that 


sup A(t, R^) «€ Cg « oo. 
te[0, 7] 
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We consider 
t 
u(t, x) a 1 A(t — s, x — y)o (u(s, y)) M (ds, dy) 
0 JRI 
t 
+ f n A(t — s, x — y)b(u(s, y))dydy, (3) 
0 JRI 


as the integral formulation of (1), where M is the martingale measure generated by 
F. In order for the stochastic integral in the previous equation to be well-defined, we 
need to assume that 


T 
J ds 2 p(dé)|FA(s\(E)22 < +00. (4) 
0 R4 


According to [9, Theorem 13] (see also [23]), equation (3) has a unique random field 
solution (u(t, x); (t, x) € [0, T] x R4) which has a spatially stationary law (this is 
a consequence of the S-property in [9]), and for all p > 2 


sup z [lut x)|? ] « oo. 
(t.x)e[0, T] xiRZ 


We will prove the following result on the existence of a density. 


Theorem 1 Fix T > 0. Assume that for all t € [0, T], A(t) is a function or a non- 
negative distribution such that (4) holds and SUP;e[g. T] A(t, R7) < oo. Assume fur- 
thermore that o and b are Lipschitz continuous functions. Moreover, we assume that 


t 
ct’ <f as | u(dé)|F AGYG)? < C™, 
0 R4 
t 
f ds| F A(s)(0)|? < Cr”, 
0 


for some y, yi. yo > 0 and positive constants c and C. Suppose also that there exists 
ô > 0 such that 


E[lu(t, 0) — u(s, 0)?] < Cir — si?, (5) 


for any s,t € [0, T] and some constant C > 0, and that 


min{yi, y2] + ô 
= —— — >. 
y 
Fix (t, x) € (OG, T] x R’. Then, the probability law of u(t, x) has a density f on 
the set (y € R; o(y) Æ 0}. In addition, there exists n > 1 such that the function 
y e |o (y)|" f (y) belongs to the Besov space BP. with P e€(Q0,y —1). 


1,00? 
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Proof The existence, uniqueness and stationarity of the solution u is guaranteed by 
[9, Theorem 13]. We will apply Lemma 1 to the law of u(t, x) at x = 0. Since 
the solution u is stationary in space, this is enough for our purposes. Consider the 
measure 


(dy) = le GI" (P o utt, 07*) (dy), 
We define the following approximation of u(t, 0). Let for0 < £ «t 
t 
u(t,0) — U*(t,0) + o(u(t — s, o» | i At —s,—-y)M(ds.dy), ©) 
t—e JRI 
where 
t—& 
U*(,0) = n n Att — 8, —yo Guts, Ms, dy) 
0 RI 
{=E 
" n f AG — s, y)b(us, y))dyds 
0 R4 
t 
+ b(u(t — €, o» f n A(t — s, —y)dyds. 
t—& JRI 


Applying the triangular inequality, we have 


| [, ate) = [E[lo ut, 0))|" Aho (uc, 0))]| 
< [|E[(o u(t, 0)" — lo (ult — e, 0) ) Az (u(t, 0))]| 
+ |E[lo (u(t — e, 0))|" (Apo (u(t, 0)) — Ajo Qi (t, 0)))]| 
+ |E[lo(u(t — £, 0l" Azo u° (t, 0)) |. (7) 


Remember that Ane llee <Ch I le - Consequently, 


ARLI = 1A; $0) Ap PE +A) < Crillbllee IhI", 


Using this fact, the first term on the right-hand side of the inequality in (7) can be 
bounded as follows: 


|E[ (lo (u(t, 0)" — |o (u(t — £, 01 ARG (u(t, 0))]] 
< Cullbllee II E[]lo (u(t, 0)" — lo ut — £, 0)" |]. (8) 


Apply the equality x" — y" = (x — y)(x"^-1 + x"2y +--+ xy"? + y^-1) along 
with the Lipschitz continuity of o and Hólder's inequality, to obtain 
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E[|lo (u(t, 0)” — lo (u(t — £, 0)" |] 
n—1 


< efjo, 0)) — o (ult — £, 0))| X` lo ut, Dllo ut, o=] 


j=0 
24 1/2 = j Y 
< C(E[|uc. 0) — u(t — e, 0)| )) (=|( > lo (u(t, 0))/ Jo (u(t, o=) ) 
j=0 
< Cn (Ellu, 0) — u(t — &, of] ^ 
< Ce, = 


where we have used that o has linear growth, also that u(t, 0) has finite moments of 
any order and (5). Thus, 


?[(o (u(t, 0)" — lo Qi — £, 0)) |") ARG (u(t, 0))]| x Cnllbllee II" e". (10) 


With similar arguments, 


|E[lo Qu — e, O)" (Az our, 0) — AT Qi .0))]| 

< C, lé lee [Io (u(t — e, 0I" ut, 0) — u(t, 0)1*] 

< C, llle (E[u, 0) — u° C, 0) ]))"" Ello ut — e, 0) 27] 7 

< Culldligee?*/? (gi(e) + 220)", (11) 


where in the last inequality we have used the upper bound stated in [20, Lemma 2.5]. 
It is very easy to adapt the proof of this lemma to the context of this section. Note 
that the constant C, in the previous equation does not depend on œ because 


< E[|o (u(t — £, 0))|" v 1]. 


(E[lo (u(t — e, 02/279] < (E[(lo (u(t — e, 0))| v D 


Now we focus on the third term on the right-hand side of the inequality in (7). Let 
ps denote the density of the zero mean Gaussian random variable SE " Sra A(t — 
5, —y)M (ds, dy), which is independent of the o-field 7;_, and has variance 


sex- f as | M(d£) £ AGE > Ce”. 
0 R4 


In the decomposition (6), the random variable U* (t, 0) is F,—s-measurable. Then, 
by conditioning with respect to ¥;_, and using a change of variables, we obtain 
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e| E[ 105 quc e,0520)lo (u(t — £, O))|" Aro (u^ (t, 05|Z.—«]]l 


m £,0))£0} [ lo (u(t — £, 0)|" A p (UF + o (u(t — e, op») peloddy]| 


IE [lo u(t — £, 0)" A26 Qi t, 0))]| 
| 


m «zo | out- e 0)” 


X $(U; =F o (u(t — €, d 


< Il lloc TT e,0)3:0)lo (u(t — €, or f LA" suceso. 


On the set (o (u(t — £, 0)) Æ O}, the integral in the last term can be bounded 
as follows, 


1 |4” cae 0-a Pe OD dy < Colo (ult — £, 0) IA" PP rim 
< Cnlo (ult — £, | h" g8", 


where we have used the property || Aj f lr:qg < Cnlh|" | fF || r1 ag). and also that 
lp% lj, = (g(e) "7? < C,&-"/? (see e.g. [20, Lemma 2.3]). 
Substituting this into the previous inequality yields 


IE[Io (u(t — £, 0)" A20 Q^ (t, 0)]| < Crliolge Ih" e 7, (12) 
b 


because |Iflloo < lille: 
With (7), (10)-(12), we have 


| f Argoed») 

< Cullølligg (Wile! + PP (os (e) ge)" + rem) 
< Crnlelge (te. 4 eOtv0o/2 4 gbt+yr)a/2 4 | er?) 
< o 


< Calle lleg (Wale? + e? + remm (13) 


Let £ = stlhl?, with p = 2n/(yn + y yo). With this choice, the last term in (13) 
is equal to 


a nó. nyo 
en I (ini yatye) 4 les). 
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Since yı < y, by the definition of y, we obtain 


: 3 
min{yi, y2} T 


y-1= 
y y y 


Fix £ € (0, y — 1). We can choose n € N sufficiently large and o sufficiently close 
to 1, such that 


nó 
a + ————— 
y (n * ya) 


>¢€-+a and -i£-a. 


This finishes the proof of the theorem. 


Remark 1 (i) Assume that o is bounded from above but not necessary bounded 
away from zero. Following the lines of the proof of Theorem 1 we can also 
show the existence of a density without assuming the existence of moments 
of u(t, x) of order higher than 2. This applies in particular to SPDEs whose 
fundamental solutions are general distributions as treated in [8], extending the 
result on absolute continuity given in [20, Theorem 2.1]. 

(ii) Unlike [20, Theorem 2.1], the conclusion on the space to which the density 
belongs is less precise. We do not know whether the order y — 1 is optimal. 


3 Ambit Random Fields 


In this section we prove the absolute continuity of the law of a random variable 
generated by an ambit field at a fixed point (t, x) € [0, T] x R^. The methodology 
we use is very much inspired by [10]. Ambit fields where introduced in [5] with 
the aim of studying turbulence flows, see also the survey papers [6, 15]. They are 
stochastic processes indexed by (t, x) € [0, T] x R of the form 


X(t, x) = w+ ff g(t, s; x, y)o (s, y)L(ds, dy) +f h(t, s; x, y)b(s, y)dyds, 
Aa) B, (x) 

(14) 
where xo € IR, g, h are deterministic functions subject to some integrability and 
regularity conditions, c, b are stochastic processes, A;(x), B(x) € [0,1] x R’ 
are measurable sets, which are called ambit sets. The stochastic process L is a Lévy 
basis on the Borel sets Z([0, T] x R^). More precisely, for any B € Z([0, T] x R?) 
the random variable L (B) has an infinitely divisible distribution; given B,..., Bk 
disjoint sets of B € Z([0, T] x R), the random variables L(B1), ..., L(Bx) are 
independent; and for any sequence of disjoint sets (Aj) jew C A((0, T] x R4), 


oo 
LU Aj) = 5 L(Aj), P-almost surely. 
j=l 


130 M. Sanz-Solé and A. Süß 


Throughout the section, we will consider the natural filtration generated by L, i.e. 
for all t € [0, T], 


F: = o (L(A); A € [0, t] x R, A(A) < oo). 


For deterministic integrands, the stochastic integral in (14) is defined as in [18]. 
In the more general setting of (14), one can use the theory developed in [7]. We refer 
the reader to these references for the specific required hypotheses on g and c. 

The class of Lévy bases considered in this section are described by infinite divisible 
distributions of pure-jump, stable-like type. More explicitly, as in [18, Proposition 
2.4], we assume that for any B € ZZ([0, T] x IR), 


log E[ exp(iéL(B))| = h -—- A(ds, ay) [. 0s, y (dz) ( exp(i&z — 1 — i&zlp 102). 


]x 
where À is termed the control measure on the state space and (o; y) (s y yepo,r]xi4 18 


a family of Lévy measures satisfying 


[ min{1, z^ )ps y (dz) =1,A-as. 
R 1 

Throughout this section, we will consider the following set of assumptions on 
(Os, y) s, yyep0, T] x R4 and on À. 


Assumptions 1 Fix (t,x) € [0, T] x R? and o € (0,2), and for any a > O let 
Og: = (—a, a). Then, 


G 


wa 


for all B € [0, œ) there exists a nonnegative function Cg € L (A) such that for 
alla > 0, 


f IzI? ps, (dz) < Cg(s, ya’, 2 — as; 
(Oa) 
(ii) there exists a non-negative function Cer! (A) such that for all a > 0, 


f Izl ps,y(dz) < C(s, y)a?*, X — a.s.; 
Oa 


(iii) there exists a nonnegative function c € L 1(à) andr > 0 such that for allé € R 
with |5| >r, 


[ (1 — cos(éz)) ps,y(dz) > cs, IRI", à — as. 
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Example 1 Let 
=a —-a-1 
Ps,y(dz) = c1 (s, y)u»gsogz ~ dz-c-iG, YI geglz~ dz, 


with (s, y) € [0, T] x Rf, and assume that c1, c; € L! (A). This corresponds to 
stable distributions (see [18, Lemma 3.7]). One can check that Assumptions 1 are 
satisfied with C = C = c1 Vc_j,andc=c, Ac_}. 


Assumptions 2 (H1) We assume that the deterministic functions g, h:(0 < s < t < 
T) x R? x R? — R and the stochastic processes (o (s, y); (s, y) € [0, T] x R4), 
(b(s, y); (s, y) € [0, T] x R4) are such that the integrals on the right-hand side of (14) 
are well-defined (see the conditions in [18, Theorem 2.7] and [7, Theorem 4.1]). We 
also suppose that for any y € R, p € [2, 00) we have SUD;e(o. 7] Ello (s, y)|?] < oo. 
(H2) Let o be as in Assumptions 1. There exist 01,02 > 0 such that for some 
y € (a, 2] and, if à > 1, forall 6 € [1, œ), or for 8 = l, ifa < 1, 


[lo (t. x) — o (s, YI] < C, t — s + |x — yl”), (15) 
z[Ib(t, x) — b(s, y)|F] < Cp (it — s? + |x — yl”), (16) 


for every (t, x), (s, y) € [0, T] x R’, and some Cy, Cg > 0. 
(H3) |o (t, x)| > 0, w-a.s. 
(H4) Let a, C, Cg and c as in Assumptions 1 and 0 < £ < t. We suppose that 


$ 
J Í, 14,œ (8, VECS, y)lg(t, s, x, y)" A(ds, dy) «oo, — (17) 
t—& 
t 
ce”? 3 i lays, yels, y)lg (t, s, x, y)|*A(ds, dy) < oo, (18) 
t—e JRA 


where in (17), c(s, y) = C (s, y) V Co(s, y), and (18) holds for some yo > 0. 
Moreover, there exist constants C, yj, y2 > 0 and y > a such that 
t ~ 
I ni lays, »)CgG. DIEE, s, x, Y)” lt — € — s|?” (ds, dy) < Ce”, 
t—e JRI 
(19) 
t 
[i E65 att sss Pls — y ds, dy) < Ce”. 
t—e 
(20) 


We also assume that there exist constants C, y3, y4 > 0 such that for all 6 € [1, o), 
ifa > l,orforB = l, ifo <1, 
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t 
| |a 15,09 G. DIAE, s, x, y)|A |t — e — s[?!Payds < CeP5, (21) 
f= 


t 
J D 18,6, DIAE, s, x, y) Plax — yl dyds < Ce”, (22) 
t—& 


where C p is defined as in Lemma 3. 
(H5) The set A; (x) “reaches f", i.e. there is no € > 0 satisfying A;(x) € [0, t — 
e] x Ri. 


Remark 2 (i) By the conditions in (HA), the stochastic integral in (14) with respect 


(i) 


(iii) 


(iv) 


to the Lévy basis is well-defined as a random variable in L^(£2) for any B € 
(0, œ) (see Lemma 3). 
One can easily derive some sufficient conditions for the assumptions in (HA). 
Indeed, suppose that 


t = 
| n l4,í3G. Y) Cg. y)lg(t, s, x, y)" A(ds, dy) < Ce’, 
t—& 


then (19) holds with y; = yı + 61. If in addition, A, (x) consists of points 
(s, y) € [0,t] x R? such that |x — yl < It — s[*, for any s € [t — £, t], and 
for some ¢ > 0, then (20) holds with yo = y; + 62¢. Similarly, one can derive 
sufficient conditions for (21), (22). 

The assumption (H5) is used in the proof of Theorem 2, where the law of 
X (t, x) is compared with that of an approximation X* (t, x), which is infinitely 
divisible. This distribution is well-defined only if A;(x) is non-empty in the 
region [t — £, t] x R°. 

Possibly, for particular examples of ambit sets A; (x), functions g, h, and sto- 
chastic processes c, b, the Assumptions 2 can be relaxed. However, we prefer 
to keep this formulation. 


We can now state the main theorem of this section. 


Theorem 2 We suppose that the Assumptions | and 2 are satisfied and that 


min{y1, 72, Ya. ya] X 1 
yo a 


(23) 


Fix (t,x) € (0, T] x R7. Then the law of the random variable X (t, x) defined by 
(14) is absolutely continuous with respect to the Lebesgue measure. 


3.1 


Two Auxiliary Results 


In this subsection we derive two auxiliary lemmas. They play a similar role as those 
in [10, Sects. 5.1 and 5.2], but our formulation is more general. 
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Lemma 2 Let p = (Ps,y)(s,yjejo,T]xR4 be a family of Lévy measures and let X be a 


control measure. Suppose that Assumption \(ii) holds. Then for all y € (o, 2) and 
all a € (0, oo) 


J |z|” ps,y(dz) < Cy aC(s, ya" *, à — a.s., 
|z|<a 


2—4 
where Cy a = 2Y? z. Hence 


T 
ri f / Iz|” ps, (dz)A(ds, dy) < Ca" *. 
0 JR4 Jjz|<a 


Proof The result is obtained by the following computations: 


oe) 
J lel” Ps, s (dz) = 2) |z|” os, y (dz) 
|z|<a AO (a2^—1 <|z|<a2-"} 


< $ (ag eyes J 
n=0 


|z|<a2—" 


Izl? ps, y (dz) 
} 


oo 

< Cs y) > (a2 n hr ?(a2 ny a 
n=0 

< Cy-aC(s, ya”. 


The next lemma provides important bounds on the moments of the stochastic 
integrals. It plays the role of [10, Lemma 5.2] in the setting of this article. 


Lemma 3 Assume that L is a Lévy basis with characteristic exponent satisfying 
Assumptions | for some æ € (0, 2). Let H = (H(t, X))(t,x)€[0,T]xR4 be a predictable 
process. Then for all0 < B «à < y <2andforall0<s<t<s+l, 


B 
s / [ laa) yar, x, y) HG, y)L(ar, ay) | 


< Cag. y |t — s|// 71 

^ B/y 
x di n lao. y) Cg, y)lgtt. r, x, y)" E[ B (u, y)” JA (ar, d») l 
S 


(24) 


where Cg (r, y) is the maximum of C(r, y), and (Cg + Ci)(r, y) (see Assumptions | 
for the definitions). 
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Proof There exists a Poisson random measure N such that for all A € BR‘), 


t t 
L([s, t] x A) =f f J zÑ (dr, dy, dz) «f | J zN (dr, dy, dz) 
s JAJ|z|<1 s JA J|z|»1 


(see e.g. [14, Theorem 4.6]), where N stands for the compensated Poisson random 
measure N (ds, dy, dz) = N (ds, dy, dz) — ps,y(dz)A(ds, dy). Then we can write 


1 


with 


n p 
J Í, law y)g(t, r, x, y)H (r, y)L(dr, ay) | < Cp(Il, 12, +R), 
s 
(25) 


] 

2 ' 3 

R= | J | racers g(t, r, x, y) HG, y)Ñ (dr, dy, dz) 
s JRI t-s)" «|z| 1 

] 

To give an upper bound for the first term, we apply first Burkholder's inequality, then 


the subadditivity of the function x  x?/? (since the integral is actually a sum), 
Jensen's inequality, the isometry of Poisson random measures and Lemma 2. We 


obtain, 
t 
T loy. y) 
s JRI [z| 5)! 


x izlett, r, x, y)? LH (r, y) N (dr, dy, dz) 


t 
Alf f la oy. y) 
s JRI Iz| e r—5)U/e 


x |zl"|gqt. r, x. y)” IH (r, y)I" N (dr, dy, dz) 


t 
«os f f lae, y) 
s JRI Iz| 5) 


B/y 
x |z” lalt, r, x, y)" |H (r, y)" N (dr, av.ad |) 


t 
-o f f Lust »( f koyda) 
s JRI Iz eas) 


t 
li T law y) zg(t, r, x, y) H(r, y) N (dr, dy, dz) 
s JRI Iz|za—s)U/e 


] 


t 
= | n lao y) zg(t, r, x, y) H(r, y) N (dr, dy, dz) 
s JRI |z|»1 


1}, < Cg | 


i 
7] 


<C 


DR 
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Bly 
x |g(t. r. x, y)" | HG, y) "Ar, 4») 


« Cg [Cali _ goo 


é Bly 
” (/ [seo ne nts rs x DEIA, tr 2 


Notice that the exponent (y — o)/o is positive. 
With similar arguments but applying now Assumption 1(1), the second term in 


(25) is bounded by 
f 
2 
n lA, co Gs y) |z| 
s JRI (t—s)!/% <|z|<1 


g 


t 
< Cg {ff laco. y) Iz]? 
s JRI (t—s)l/8 «|z|«1 


x |gtt, r, x, y)lP IH (r, y)l? N (dr, 2.45) 


t 
| / f Lust »( | lfp) 
s JRI (t—s)!/" «|z| «1 


x |gtt, r, x, y)P |H (r, y)lP (ar, ZI 


D z Cg | 


x |2, r, x, y)? |H (r, y)? N (dr, dy, dz) 


=C 


D 


t 
< Cg(t — Poe | / n 14,œ) (r, y)Ca(r, y) 
s JRI 
x |g(t. r, x, YIP [IH r, y)1P (ar, ay)| 
t 
< Cg, (t — aean] n lA cy Gs y)Cgtr, y) 
s JIR4 
Bly 
* let. rx, PEL ( PI pr dy) ; 


where in the last step we have used Hölder’s inequality with respect to the finite 
measure Cg (r, y)A(dr, dy). 

Finally, we bound the third term in (25). Suppose first that 8. < 1. Using the 
subadditivity of x +> x^ and Lemma 2(i) yields 


t 
iB s ege] [sen f. after x ors oer ay, ad 
Ss |z|>1 


t 
< CgE n Lace »( f zl? or,y (dz) 
s JR4 |z|»1 
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x |g(t. r, x, y)PI HG, y)lP (ar, a» 


t 
s Cg | J L , Ico frs Cpr IIe r, x, y)" LH C, y) Ar, ZI 


B/y 


t 
< cf L laa y)Cg(r, ywlgt.r, x, yr z [1H (r, y)” Jar, 2 , 


where in the last step we have used Hólder's inequality with respect to the finite 
measure Cg(r, y)A(dr, dy). 
Suppose now that 6 > 1 (which implies thata > 1). We apply Hólder's inequality 
with respect to the finite measure C, (r, y)A(dr, dy) and Assumption 1(1) 
] 


t 
I. < 2P—'m / f lac) Q0. y) zg(t, r, x, y) H (r, y) N (dr, dy, dz) 
; s JRI |z|»1 


t 
n. sf f Lust » (f iip. dz) 
s JRI |z|»1 


B 
x g(t,r, x, y) H(r, y)A(dr, ay) | 


t 
< Cp t I lac (rs y) Iz? lett. r, x, y)? 
s JRI |z|»1 


"| 


t B 
/ a Vossio 63. WING 30 Dus ay) | 


x |H(r, y) N (dr, dy, dz) 


+ ey 


t 
« Cg i] PA Petr x PIG IPN Car dy, dz) | 
s JRI |z|»1 


t B—1 
«e f | Cv 94r. dy)) 
s Rd 


t 
x J f | Leo Ce VIl rx, PELL GS IP dr, dy) 


t 
< Cp | | L 1a, yC C, y) + Ca, y) 


x |g(t, r, x, FIA (u, y)|fà(dr, d»)| 


t 
= ef I, laa y)(Ci(Gr, y) + Cer, y)) 


B/y 
x |g, r, x. y)" [rc o? lar dy)) ; 


Non-elliptic SPDEs and Ambit Fields: Existence of Densities 137 
where in the last step we have used Hólder's inequality with respect to the finite 


measure (C1(r, y) + Cg(r, y))A(dr, dy). We are assuming 0 < t — s < 1, and 
0 < B <a. Hence, the estimates on the terms Z} ,, i = 1, 2, 3 imply (24) 


s,t? 
3.2 Existence of Density 


With the help of the two lemmas in the previous subsection, we can now give the 
proof of Theorem 2. Fix (t, x) € (0, T] x IR? and let 0 < £ < t to be determined 
later. We define an approximation of the ambit field X (t, x) by 


t 
X*(t, x) = uet otou- f f Lax) (s, y) g(t, 5; x, y)L(ds, dy), (26) 
t—& JRI 
where 
t-—e 
Ui abeat n f | Vo Dti y) ss LCs, dy) 
0 
t—& 
" f n boni nace s aude 
0 Rd 
t-—é 
+ ba —e.x) | [a 1p,co S, yA, s; x, y)dyds 
0 R 


Note that U* (t, x) is F;---measurable. 

The stochastic integral in (26) is well defined in the sense of [18] and is a random 
variable having an infinitely divisible distribution. Moreover, the real part of its 
characteristic exponent is given by 


R(log E| expGéX)]) = 1. (1 — cos(€z)) o¢ (dz), 


where 
pf(B) 2 ] revemo s ds, dy) 
[0, T] x R4 JR 


In the setting of this section, the next lemma plays a similar role as [20, Lemma 
2.3]. It generalizes [10, Lemma 3.3] to the case of Lévy bases as integrators. 


Lemma 4 The Assumptions 1, along with (17) and (18) hold. Then, the random 
variable 


t 
xX: = i i Lac(s, 3985 s, x, y)L(ds, dy) 
t—e JRI 


has a €% -density Ppr .«, and for all n € N there exists a finite constant Cj, > 0 such 
that | pf glia S Cria” ^ D. 
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Proof We follow the proof of [10, Lemma 3.3], which builds on the methods of [21]. 
First we show that for |&| sufficiently large, and every t € (0, T], 


Crx elél” x RWx(E) x CIEI. Q7) 


Indeed, let r be as in Assumption 1(iii). Then, for |5| > r, we have 


Wy (E) = L (1 — cos(£z)) pp (dz) 
t 
= f n A(ds, dy) n (1 — eos (Ez 1a, co (S. gE s, x. 9) 0s, (d2) 
t—e JRI R 


t 
> e | f | Vaso Gs DIEE, s, x, eG, A (ds, dy) 
t—g 


2 rexe nEn 


(28) 


This proves the lower bound in (27) for |&| >r. 
In order to prove the upper bound in (27), we set 


UE ts x, y: = Ella ay s, y)la(t, S, X, y)| 


and use the inequality (1 — cos(x)) < 2(x? ^ 1) to obtain 
t 
wwe) = f f aads dy) | (- costet tacols, ats ns ID) oss (d) 
1—& 
t 
<2 f f masay f (EPPA os gE sx DP ^ Nes d 
t—e J R° 


t 
=al ] is.» IPIE P La co Gs Dl ts s, x, Y) ps, y (dz) 
t—& JRI Izlag, x.y 


t 
+2f l à (ds, dy) Ds, y (dz). 
t—e JRI EE 


Then, using Assumption 1(ii), the first integral in the right-hand side of the last 
equality in (29) can be bounded as follows: 


t 
f [das DEEP. s (f Iz ^p, (dz) 
t—e JRI BEN 


t 
< er f ds l4,65G. DIZE, s, x, y)|* Cs, y)A (ds, dy) 
t—& 


< CIĘ”, 


(29) 


where in the last inequality, we have used (17). 
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Consider now the last integral in (29). By applying Assumption 1(1) with 8 = 0 
and (17) 


t 
f nasan ( / 2 
t—e JIR4 Izlxag is x,y 


t 
= er f i Lays, y)Cols, y)1g(t, s, x, y) (ds, dy) 
t-e 


< CJE”. 
Hence, we have established that 
9twx (E) < CIEI, 


for |E | sufficiently large. 

To complete the proof, we can follow the same arguments as in [10, Lemma 3.3] 
which rely on the result in [21, Proposition 2.3]. Note that the exponent yo on the 
right-hand side of the gradient estimate accounts for the lower bound of the growth 
of the term in (18), which in the case of SDEs is equal to 1. 


The next lemma shows that the error in the approximation X*(f, x) in (26) and 
the ambit field X (t, x) is bounded by a power of e. 


Lemma 5 Assume that Assumptions 1 hold for some a € (0, 2) and that c, b are 
Lipschitz continuous functions. Then, for any B € (0, œ), and e € (0,t ^ 1), 


[IX (t,x) — X*( x^] < cue 9) 


where y: = min(yi, y2, ya. ya]- 
Proof Clearly, 


[lX (t,x) — X* (t, OF] 


«esl 


t p 
| n Vott E IP) oot as ay) | 
t—& 


] 
Fix y € (o, 2] and apply Lemma 3 to the stochastic process H (s, y): = o(s, y) — 
o (t — £, x), where the arguments f, £, x are fixed. We obtain 


t 
+ Cg | J |, 15,0), y)h(t, 5; x, y)(b(s, y) — b(t — e, x) dyds 
t-eJ/JR 
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i B 
|| | [a Luo Dali sia No(s, 9) = al — 8, Ldss dy) | 
t= 


t 
< Cago f is 14,69 G. Y)CgG, Yla w lgt, s, x, y)" 
t—& 


Bly 
x E[lo(s, y) — o (t — e, x)|” Jas, 2 


Owing to hypothesis (H2) this last expression is bounded (up to the constant 
Ca piye et) by 


t 
(/ | Laws, y)Ca(s, Yla w lelt, s, x, y)" 
t—e JRI 
Bly 
x (It-e— s|?” + |x — yl?”)acds, d») 
The inequality (19) implies 
i i 2 g Bly 
ehe (/ [5 Lo Ep. y)lgt,s, x, y) It — € — sl Ads dy) 
1—&€ 
gulis 1, 


and (20) yields 


: . B/y 
staf f n laos, Y)Cg(s, y)lg(t, s. x, y)” |x — yis. dy)) 
t—E 


1 
< CePG tY»2-7l. 


Thus, 


t p 
f [ | Muros Dat, 5; x, OCS, y) - ott. Lids. dy) | 


1 


Assume that 6 > 1 (and therefore a > 1). Hólder's inequality with respect to the 
finite measure h(t, s; x, y)dyds, (H2), (21), (22), imply 
] 


l 


t 
<c f f Meo DIAE, sx PEIG, y) = b0 = 5 P dyads 
t—& 


< Ceblatnan)-1, (30) 


t 
Í [ | Aseo s BG, s; x, Ols, y) — btt — £, x))dyds 
t—& 
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' ô 
Co f [meo DhE sx Pl e — sP dyas 
t-e JR 


t 
& C5 f f neo Wis sx) — y dyds 
t—é 


< Ceb Orv), 


Suppose now that 6 < 1, we use Jensen’s inequality and once more, (H2), (21), 


(22), to obtain 
l B 
t a Vp œ (8; DAC, s; x, y) (b, y) — BU — £, x) )dyds | 
B 
(= Lf [rats otis six 166, y) = ba — e ots |) 
B 
-( ha 18,08, DIAG, s; x, YELI, y) — bG — e, 22 


B 
«ef de Lp,c(s, IAC, 5; x, [le = sti + Ix — y] dyas) 


< Ce BOnS^Yya). 


IA 


This finishes the proof. 
We are now in a position to prove Theorem 2. 
Proof (Proof of Theorem 2) We consider the inequality 
Ello (t, x)!" A26 (X (t, x) < [E[(lo t. oI" — lo (t — e, 30") A2 X (t, x))] | 


+ [E[le t — e, x)|” (A50 (X t, x)) — A59 (X° tt, x)))]| 
+ [E[lo (t — £, x)|” ARo (X* t, x) |]. (31) 


Fix 7 € (0, ^ 1). As in (8) we have 


z[(Io (t. x)|" — lo (t — e, x)|”) Az (X (t. x))]| 
< CallollgrlAl"E[|lo(t, x)" — Io — x)!" |]| 


Now we proceed as in (9) using the finiteness of the moments of o (t, x) stated in 
Hypothesis (H1), and (H2). Then for all y € (o, 2] we have 


a[|lo(t, x)|" — le (t — e, x)" |] 


n—i 
= elec. x)— o(t — &, x)| > lo (t, x)| |o (t — e, orti 


j-0 
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jo (t, x) —o(t — e») 


n—l AN Y/G-D3N 1-l/y 
x (=| (Stee. rto - e oorr) )) 
j=0 


< Cre". 


IA 
OQ 
— 


Therefore 


JE[(lo t, 1" — lot — e, D) Ape X )]| < CallollgnlAlte". — (32) 


Consider the inequality I An óleo < Ch Idee. and apply Hólder's inequality 
with some f € (n, œ) to obtain 


a[lo(t — e, x)|" CA (X t, x) — Ag (X° Qs 0))]| 
< CallollgrE[lo t — e, x) "1X Œ, x) — X° t, x)I"] 


< Calile (ELX €, x) — X°, OPN (E[Io Qut — e, 0) 67] 7v? 


zd 


"e 
« Calles e) 5 (33) 


where y: = min{y1, y2, 73, 74}, and we have applied Lemma 5. 
Conditionally to ¥;—,¢, the random variable 


t 
J f law G. y)g(t, s; x, y)L(ds, dy) 
t—e JRI 


has an infinitely divisible law and a @°-density pr, xe for which a gradient estimate 
holds (see Lemma 4). Then, by a discrete integration by parts, and owing to (H3), 


|E[lo «t — e, x)|” Az 0 (X* (t, x) ]| 


= n lo (t — e, x)" A p (UF 4- o (t — e, I 


= "i lo (t — e, x)" (UF -F a (t — e, 99A" Las Prss 0p] 


< IIblloo [iow -e L P 
From Lemma 4 it follows that 


L |A" sae- Praxe Ody < Calo (t — e, DI IAP IPE lc 


< Clo (t — e, x)|” h P e7”, 
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which yields 


[Ello (t — 5:0" Ape XE E, x]| < Colléllgnlhl"e7 ^, — G4) 


because ||ólloo < lole. 
The estimates (31)-(34) imply 


l B 
[lo (t, x)" ARO QC E, 2)]I < Cup lldllgn (In? 4 e )75 4 eom 


Sete = 5|h|^, with |A| < 1 and 


P ( aß 2) 
Btapy—a’ nyo 


a(n=n) _ 


Notice that, since limy— oo PU a for P close to o and yo as in the hypothesis, 
this interval is nonempty. Then, easy computations show that with the choices of € 


and p, one has 


fitus m 
inne + e G09)73 4 ureomíe < gt 


with ¢ > 5. Hence, with Lemma | we finish the proof of the theorem. 


Remark 3 (i) If o is bounded away from zero, then one does not need to assume 
the existence of moments of sufficiently high order. In this case one can follow 
the strategy in [20]. 
(ii) The methodology used in this section is not restricted to pure-jump stable-like 
noises. One can also adapt it to the case of Gaussian space-time white noises. 
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Part II 
Applications 


Dynamic Risk Measures and Path-Dependent 
Second Order PDEs 


Jocelyne Bion-Nadal 


Abstract We propose new notions of regular solutions and viscosity solutions for 
path-dependent second order partial differential equations. Making use of the martin- 
gale problem approach to path-dependent diffusion processes, we explicitly construct 
families of time-consistent dynamic risk measures on the set of càdlàg paths /R" val- 
ued endowed with the Skorokhod topology. These risk measures are shown to have 
regularity properties. We prove then that these time-consistent dynamic risk mea- 
sures provide viscosity supersolutions and viscosity subsolutions for path-dependent 
semi-linear second order partial differential equations. 


Keywords Path-dependent PDE - Risk measures * Martingale problems 


MSC: 35D40- 35R15 - 35K55 - 60J60 - 91B30 


1 Introduction 


Diffusion processes are linked with parabolic second order Partial Differential 
Equations via the *Feynman-Kac" formula. The field of path-dependent PDEs first 
started in 2010 when Peng asked in [19] whether a BSDE (Backward Stochastic 
Differential Equation first introduced in [17]) could be considered as a solution to a 
path-dependent PDE. In line with the recent literature on the topic, a solution to a 
path-dependent second order PDE 


Hu, w, ou, w), duh (u, w), Dy, (u, w), D?¢ġ(u, w)) = 0 (1) 


is searched as a progressive function $ (u, œ) (i.e. a path dependent function depend- 
ing at time u on all the path up to time u). 
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In contrast with the classical setting, the notion of regular solution for a path- 
dependent PDE (1) needs to deal with càdlàg paths. Indeed to give a meaning to 
the partial derivatives D. (u, œw) and Deo (u, œw) at (uo, wo), one needs to assume 
that $ (uo, w) is defined for paths w admitting a jump at time uo. Peng has intro- 
duced in [20] a notions of regular and viscosity solution for path-dependent second 
order PDEs. In [20] a regular or a viscosity solution for a path-dependent PDE is a 
progressive function ó (t, c») defined on the space of càdlàg paths endowed with the 
uniform norm topology and the notion of continuity and partial derivatives are those 
introduced by Dupire [12]. A comparison theorem is proved in this setting [20]. 
The motivation comes mainly from the theory of BSDE and examples of regular 
solutions to path-dependent PDEs can be constructed from BSDEs [18]. The main 
drawback for this approach based on [12] is that the uniform norm topology on the 
set of càdlàg paths is not separable, hence it is not a Polish space. Recently Ekren et 
al. proposed a notion of viscosity solution for path-dependent PDEs in the setting of 
continuous paths in [13, 14]. This work was motivated by the fact that a continuous 
function defined on the set of continuous paths does not have a unique extension 
into a continuous function on the set of càdlàg paths. Therefore it is suitable that 
the notion of viscosity solution for functions defined only on the set of continuous 
paths does not require to extend the function to the set of càdlàg paths. The approach 
developed in [13, 14] is also based on BSDE. 

In the present paper we introduce a new notion of regular and viscosity solution 
for path-dependent second order PDEs (Sect. 2). A solution to (1) is a progressive 
function ¢ defined on JR, x 2 where €2 is the set of càdlàg paths. In contrast with [20] 
and many works on path-dependent problems, we consider the Skorokhod topology 
on the set of càdlàg paths. Thus £2 is a Polish space. This property is very important. 
To define the continuity and regularity properties for a progressive function, we make 
use of the one to one correspondence between progressive functions on IR, x «2 
and strictly progressive functions on IR} x £2 x IR" established in [3]. A function 
$ defined on IR} x €2 is progressive if $ (s, w) = $ (s, w’) as soon as w(u) = o'(u) 
for all 0 < u < s. A function ¢ defined on IR} x (2 x IR" is strictly progressive 
if $(s, w, x) = d (s, œ, x) as soon as w(u) = o'(u) for all 0 < u < s. The one to 
one correspondence is given by Q (s, œ, x) = d(s, w xs x) where (c; x)(u) = w(u) 
for all 0 < u < s and (o xs x)(u) = x for all u > s. The continuity and regularity 
properties that we want for a progressive function $ are derived from the usual 
continuity and regularity properties for $ via the above one to one correspondence. 
For example, D$ (u, œ) is defined as D«$ (u, œ) := D,b(u, œw, @(u)) where D, is 
the usual partial derivative of $ with respect to the third variable. Notice that via the 
above one to one correspondence, the regularity properties for a progressive function 
$ are defined in a very natural way. This is in contrast with the most commonly used 
regularity definitions first introduced in [12]. 

The notion of viscosity solution that we introduce in the present paper is motivated 
by our construction of a solution to semi-linear second order path-dependent PDEs 
based on the martingale problem approach. 

Our study for viscosity solutions of path-dependent PDEs allows then to introduce a 
new definition of viscosity solution for path-dependent functions defined only on the 


Dynamic Risk Measures and Path-Dependent ... 149 


set of continuous paths. As in [13, 14], this does not require to extend the function 
nor the coefficient functions appearing in the path-dependent PDE to the set of càdlàg 
paths. However our approach is very different from the one introduced in [13, 14]. 
In the present paper we construct then time consistent dynamic risk measures on the 
set 2 of càdlàg paths, to produce solutions for path-dependent semi-linear second 
order PDEs. 


QuV(u, w) + .Z*v(u, w) + f (t, w, Dxv(u, )) = 0 on [0, t] x 2 2 
v(t, o) = h(o) (2) 


with “4 v(u, œw) = 5Trlatu, w)D2v(u, w)]. 

These dynamic risk measures are constructed using probability measures solution 
to a path-dependent martingale problem. This approach is motivated by the Feyn- 
man Kac formula and more specifically by the link between solutions of a parabolic 
second order PDE and probability measures solutions to a martingale problem. The 
martingale problem has been first introduced and studied by Stroock and Varadhan 
[10, 11] in the case of continuous diffusion processes. The martingale problem is 
linked to stochastic differential equations. However the martingale problem formu- 
lation is intrinsic and is very well suitable to construct risk measures. In [22] the 
martingale problem has been extended and studied to the case of jump diffusions. In 
[3], the study of the martingale problem is extended to the path-dependent case which 
means that the functions a and b (and also the jump measure) are no more defined 
on JR+ x IR" but on IR, x 2. The question of existence and uniqueness of a solution 
to a path-dependent martingale problem is addressed in [3] in a general setting of 
diffusions with a path-dependent jump term. In the case where there is no jump term 
and under Lipschitz conditions on the coefficients, the existence and uniqueness of a 
solution has been already established in [8] from the stochastic differential equation 
point of view. 

In Sect. 3, we recall some results from [3] on the martingale problem for path- 
dependent diffusion processes and study the support of a probability measure solution 
to the path-dependent martingale problem for 7%”. 

The theory of dynamic risk measures on a filtered probability space has been 
developped in recent years. In the case of a Brownian filtration, dynamic risk mea- 
sures coincide with g-expectations introduced by Peng [21]. An important property 
for dynamic risk measures is time consistency. The time consistency property for 
dynamic risk measures is the analogue of the Dynamic Programming Principle. For 
sublinear dynamic risk measures time consistency has been characterized by Del- 
baen [9]. For general convex dynamic risk measures two different characterizations 
of time consistency have been given. One by Cheridito et al. [7], the other by Bion- 
Nadal [5]. This last characterization of time consistency is very useful in order to 
construct time consistent dynamic risk measures. 

Following [5], one can construct a time consistent dynamic risk measure as soon 
as one has a stable set of equivalent probability measures 2 and a penalty defined 
on 2 satisfying some conditions. In Sect. 4, we construct a stable set of probability 
measures on the set £2 of càdlàg paths. In the whole paper a is a given bounded 
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progressively continuous function defined on JR, x £2 such that a(s, œw) is invertible 
for all (s, c»). Forall r > Oand w € 2 the set of probability measures 2, w is a stable 
set generated by probability measures owe solution to the martingale problem for 
"^^ starting from c at time r. The functions b are assumed to satisfy some uniform 
BMO condition. In Sect. 5 we construct penalties on the stable set 2, a from a path- 
dependent function g. Some growth conditions are assumed on the function g to 
ensure integrability properties for the penalties. With such a stable set and penalties, 
we construct in Sect. 6 time consistent convex dynamic risk measures. More precisely 
for all r and œw we construct a time consistent convex dynamic risk measure Psr on 
the filtered probability space (2, (2), QF o) where Q7 „ means Q9 and (4j) is 
the canonical filtration. 

We prove furthermore in Sect. 7 that these time consistent dynamic risk measures 
satisfy the following Feller property: Let @ be the set of A, measurable functions 
h defined on §2 which can be written as h(w) = k(w, w(t)) for some continuous 
function k on 2 x IR" such that k(c, x) = k(o’, x) if w(u) = o (u) for all u < t. 
Then for all A in G, there is a progressively lower semicontinuous function R(A) on 
[0, t] x 2 such that R(A) (t, œw) = h(o), 


ori" (h) = R(h)(r, eo) V0 <r <t 
Furthermore, for all 0 € r € s « t, 
pri? ho") = ROG. o^) Qf oy a.s. 


We prove furthermore in Sect. 8 that the lower semicontinuous function R(h) is 
a viscosity supersolution for the path-dependent semi linear second order partial 
differential equation (2). The function f : IR+ x IR" x IR" — IR appearing in Eq. (2) 
is linked to the choice of the penalty of the risk measure. It is convex in the last 
variable. 

We prove also that the upper semi-continuous envelope of R(h) is a viscosity subso- 
]ution for (2). 

When the above function A is defined only on the set of continuous paths, it is the same 
for the function R(h). We prove then that R(h) provides a viscosity supersolution and 
a viscosity subsolution for (2) on the set of continuous paths. 


2 Solution of Path-dependent PDEs 


Inthis section we introduce new notions for regular and viscosity solutions for second 
order path-dependent PDEs on the set of càdlàg paths. In contrast with [20] and all 
the papers using the notions of continuity and derivative introduced by Dupire [12], 
we work with the Skorohod topology on the set of càdlàg paths. A solution to a 
path-dependent PDE (1) is a progressive function $ (t, œw) where t belongs to JR and 
w belongs to the set of càdlàg paths. 
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2.1 Topology and Regularity Properties 


In the whole paper 2 denotes the set of càdlàg paths with the Skorohod topology. 
The set 42 is then a Polish space (i.e. is metrizable and separable). Polish spaces have 
nice properties which are very important in the construction of solutions for path- 
dependent PDEs. Among them are the existence of regular conditional probability 
distributions, the equivalence between relative compactness and tightness for a set 
of probability measures, to name a few. 

To define the continuity and regularity properties for progressive functions, we use 
the one to one correspondence between progressive functions on JR; x 92 and strictly 
progressive functions on IR} x 42 x IR" that we have established in [3]. 

A function $ defined on IR} x €2 is progressive if $ (s, œ) = (s, œ") as soon as 
w(u) = o'(u) for all 0 < u < s. 

A function $ defined on IR} x Q x IR" is strictly progressive if $(s, œ, x) = 
Q (s, œ, x) as soon as w(u) = e (u) for all 0 < u < s. 


The one to one correspondence $ — ¢ is given by $ (s, w, x) = $ (s, c, x) where 
w xs x(u) -o(uvVOxuscs andosx,x(u)—xVuzrs. (3) 
Notice that ¢ (s, œ) = ó(s, w, w(s)). Accordingly a progressive function $ (in 2 


variables (s, «)) is said to be progressively continuous if the associated function ¢ 
(in 3 variables (s, c, x)) is continuous on IR} x Q x IR". 


2.2 Regular Solution 


Making use of the one to one corrrespondence between progressive functions on 
IRs x Q and strictly progressive functions on IR} x £2 x IR", we can then give the 
following definition for a solution to a general path-dependent PDE. 


Definition 1 Let v be a progressive function on IR} x S2 where X2 is the set of 
càdlàg paths with the Skorokhod topology. v is a regular solution to the following 
path-dependent second order PDE 

Hu, w, v(u, œ), ðuv(u, w), Dyv(u, w), Dyv(u, @)) = 0 (4) 


if the function v belongs to 7)? (IR, x 2 x IR") and if the usual partial derivatives 
of v satisfy the equation 


Au, w *y x, V(U, o, x), Quv(u, o, x), Dyv(u, o, x), D?y(u, @,x)=0 (5) 


with v(u, w, x) = v(u, c *y x) 
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(@ xu x)(s) = w(s) Vs < u, and (w *, x)(s) = x Vs > u. The partial derivatives of v 
are the usual ones, the continuity notion for v is the usual one. 


2.3 Viscosity Solutions on the Set of Cadlag Paths 


The following definitions are motivated by the construction of viscosity solutions for 
path-dependent PDEs that we develop in the following sections. Our construction of 
solutions is based on the martingale problem approach for path-dependent diffusions. 
The support of every probability measure DA solution to the martingale problem 
for -Z%? starting from «o at time r is contained in the set of paths which coincide with 
wo up to time r. This is a motivation for the following weak notion of continuity and 
also for the weak notion of local minimizer (or local maximizer) that we introduce 
in the definition of viscosity solution. 


Definition 2 A progressively measurable function v defined on IR, x (2 is contin- 
uous in viscosity sense at (r, wo) if 


v(r, wo) = lim {v(s, w), (S,@) € De(r, wo)} (6) 


where 


D,(r, wo) = {(s, œ), rSs<r+e, o(u) = vou), VO Sur 


@(u) = w(s) Vu > s, and sup ||o(u) — a@o(r)|| <e} (7) 


v is lower (resp. upper) semi continuous in viscosity sense if Eq.(6) is satisfied 
replacing lim by lim inf (resp. lim sup). 


Definition 3 Let v be a progressively measurable function on UR» x Q, (2)) 
where €2 is the set of càdlàg paths with the Skorokhod topology and (47;) the canon- 
ical filtration. 


1. vis a viscosity supersolution of (4) if v is lower semi-continuous in viscosity 
sense, and if for all (to, wo) € IR. x Q, 


e vis bounded from below on D; (to, wo) for some € > 0. 

e for all strictly progressive function @ € A 02 OR, x 92 x IR") such that 
V(to, w0) = ¢ (to, Wo), and (to, wo) is a minimizer of v — $ on Dg (to, wo) for 
some € > 0, 


A(u, œ xu x, ġ (u, v, x), Qu (u, œw, x), Do (u, œ, x), D? (u, @,x)>0 (8) 


at point (fo, wo, c (to)). 


Dynamic Risk Measures and Path-Dependent ... 153 


2. visa viscosity subsolution of (4) if v is upper semi-continuous in viscosity sense, 
and for all (to, wo), 


e vis bounded from above on D; (to, wo) for some £ > 0 

e for all strictly progressive function @ € LA 02 OR, x Q x IR") such that 
V(to, w0) = $ (to, wo), and (to, wo) is a maximizer of v — $ on De (to, wo) for 
some € > 0, 


H (u, œ x, X, Q(u, v, x), Q4 (u, œw, x), Do (u, œ, x), D?(u, @,x) x 0 (9) 


at point (fo, co, co (to)). 


3. v is a viscosity solution if v is both a viscosity supersolution and a viscosity 
subsolution. 


2.4 Viscosity Solution on the Set of Continuous Paths 


Recently Ekren et al. [13, 14] introduced a notion of viscosity solution of a path- 
dependent second order PDE for a function v defined on the set of continuous paths. 
One motivation for this was to define the notion of viscosity solution without extend- 
ing the function v to the set of càdlàg paths. 

We can notice that within our setting we can also define a notion of viscosity solution 
for a function v defined only on the set of continuous paths without extending v. We 
give the following definition which is very different from that of [13, 14] and much 
simpler. 


Definition 4 Let v be a progressively measurable function defined on IR; x 

@ UR, IR") with the usual uniform norm topology. v is a viscosity supersolu- 

tion of (4) if v is lower semi-continuous in viscosity sense and for all (fo, wọ) € 

IRs x CUR, IR") 

e vis bounded from below on D, (to, wo) for some e€ > 0, 

e forall function strictly progressive $ € g! Do CR} x 2 x IR") such that v(fo, wo) = 
(to, wo), and (to, œo) is a minimizer of v — ¢ on D, (to, wo) for some e > 0, 


H (u, w *y x, $i, w, X), 8,9 (u, w, X), D.ó(u, æ, X), D?ó(u, w,x)>0 (10) 


at point (fo, wo, wo(to)). . 
Here 2 is the set of càdlàg paths with the Skorokhod topology, D; is the intersection 
of D, with the set of continuous paths, and $ (u, w) = $(u, w, w(u)). 


The lower semi-continuity property in viscosity sense at (fo, œo) in Definition 4 
means 
V(fo, co) = lim (v(s, w), (s, œ) € De(to, 0)] 
E 
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We have a similar definition for a viscosity subsolution. Notice that the continuity 
along the sets D, is also considered in [13, 14]. However the notion of viscosity 
solution introduced in [13, 14] is fundamentally different from ours. 

We will now construct time consistent dynamic risk measures making use of proba- 
bility measures solution to a path-dependent martingale problem. We will then prove 
that this leads to viscosity solutions to path-dependent PDEs (2). 


3 Path-dependent Martingale Problem 


In the classical setting, the Feynman Kac formula establishes a link between a solu- 
tion of a parabolic second order PDE and probability measures solutions to a mar- 
tingale problem. Assume that v is a solution of the PDE „v(t, x) + Le v(t, x) — 0, 
v(T, .) = h with 


Le vit,x) = jT. x))D2 (v)(t, x) + b(t, x)* Dv(t, x) 


From the Feynman Kac formula, the value v(t, x) can be expressed from the prob- 
ability measure of solution to the martingale problem associated to the operator 
Lub starting from x at time f. v(t, x) = E geb (h(Xr)), where (X,) is the canonical 
process. d 

One natural way to construct soliutions for path-dependent parabolic second order 
partial differential equations is thus to start with probability measures solution to 
the path-dependent martingale problem associated to the operator .Z/^^ for path- 
dependent coefficients a and b. Let 2 be the set of càdlàg paths and (4%;) be the 
canonical filtration. Let a and b be progressively measurable functions on IR, x «2 
(a takes values in non negative invertible matrices and b in JR"). Let Z ab be the 
operator defined on E UR") by 


Le 8? ii a 
Lb, w) = = > a(t, e) —— + > dilt, ©) — 11 
(60) — 7 aij (t orans i irs (11) 


Definition 5 Let r > 0, wo € Q. A probability measure Q defined on (2, (2) 
is a solution to the path-dependent martingale problem for .Z/^^ starting from wọ at 
time r if 

Qo € 2 lo(u) = wou) VO Sus rp -]1 


and if for all f € ^ (R, x IR"), and all t, (Z4/?),-<1 given by 


‘a 
Zi? = f0.X 9) - ft X) f s +L? (u, o) (f), Xu(@))du (12) 


is a (Q, (2,)) martingale. 
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In [3] we have studied the more general martingale problem associated with path- 
dependent diffusions with jumps. We have shown that the good setting to prove 
that the martingale problem is well posed is to deal with diffusions operators whose 
coefficients a and b are progressively continuous. 

Recall the following result from [3]. 


Theorem 1 /. Let a be a progressively continuous bounded function defined on 
IR} x 2 with values in the set of non negative matrices. Assume that a(s, œw) is 
invertible for all (s, œw). Let b be a progressively measurable bounded function 
defined on IR} x Q with values in IR". For all (r, wo), the martingale problem 
for Le starting from wọ at time r is well posed i.e. admits a unique solution 
Du on the set of càdlàg paths. 

2. Assume furthermore that b is progressively continuous bounded. Consider the 
set of probability measures M (82) equipped with the weak topology. Then the 
map 

(ro, x) € R} x Q x IR" > OF” € //(Q) 


1, WX 


is continuous on ((r, œ, x) | o = ox, x}. 


3.1 The Role of Continuous Paths 


a,0 
T,OQ 


In all the following Q7 ,, means Q. 
We start with a result which proves that the probability measure Q^^^ is supported 


F,OQ 
by paths which are continuous after time r. 


oye "qu „ab . ; 
Proposition 1 Every probability measure Us solution to the martingale problem 
for Le starting from wo at time r is supported by paths which are continuous after 
time r, i.e. continuous on |r, cof. 
More precisely 

Qr oe ([e, wu) = wolu) Vu < r, and ox. € € (r, cof, IR")) = 1 


r,@0 


a,ab 
T,OQ 


Proof 'The probability measure Q 
that b = 0. 

The function a is progressively continuous. This means that the function a is contin- 
uous. Let a, be the I delayed function defined as a, (u, œw, x) = a(u — L, €, X) for 


is equivalent with Q7 .,,. Thus we can assume 


all u > r++ anda,(u, œ, x) = a(r, œ, x) for all 0 < u < r + 1. The function an 
is also progressively continuous. Given n, let tý be an increasing sequence such that 
tọ =r and [tg ,» — tl < i, 

On a Polish space for every subsigma algebra of the Borel sigma algebra there 
exists a regular conditional probability distribution. It follows from [22] and the 
uniqueness of the solution for 7° starting from w at time f”, that for all b. 


Qr oE) = Eom, (§|Bx)(@) for QF", almost all w. Let a, 4 (u, x) = T, (u, o, x) 
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Let Ax = (o, Oy gal € '& (It, t5, D]. Given œw the function an,w is not path- 


dependent. It follows then from [11] that Ons ® is supported by paths continuous on 


[t oo[. We remark that a, (u, o, x) = Ge for alli? < u < [oio and all o 
such that c (u) = w(u) for all u < t. It follows that Q (Ak) = | for all œ. We 
E. 


deduce by induction that QF". ({a’, Ot, cof € € (ir, oo[, IR")) = 1. 

The a, being uniformly bounded, for given r and wo, the set of probabilty mea- 
sures (Q?,,, n € N*} is weakly relatively compact. There is a subsequence weakly 
converging to a probability measure Q. From the continuity assumption on a, it fol- 
lows that Q solves the martingale problem for .Z/^? starting from «o at time r. The 
uniqueness of the solution to this martingale problem implies that Q = Q% o. The 
set {a’, rr col € €([r, oo[, JR")) is a closed subset of 2. It follows from 
the Portmanteau Theorem, see e.g. [2] Theorem 2.1, that Q 
€ (Er, cof, IR")} = 1 


ro b^ d Ot. ool € 


Corollary 1 For all continuous path wo and all r, the support of the probability 


measure pru is contained in the set of continuous paths: 


Qro C (UR+, IR")) = 1 


Remark 1 In the simpler case where the function a is only defined on the set of con- 
tinuous paths, the continuity hypothesis is just the usual continuity hypothesis for a 
function defined on IR} x € ([R,, IR")) for the uniform norm topology. The asso- 
ciated martingale problem: probability measure solution to the martingale problem 
for .Z/^Ó starting from wọ at time r can only be stated for initial continuous paths wọ 
(otherwise the path-dependent function a(u, œw) should be defined for paths w which 
can have jumps before time u). 


4 Stable Set of Probability Measures Solution to a 
Path-dependent Martingale Problem 


In all the paper £2 denotes the set of càdlàg paths endowed with the Skorokhod topol- 
ogy. From now on, a(s, œw) is a given progressively continuous function on IR} x «2 
with values in non negative matrices. We assume that a is bounded and that a(s, w) 
is invertible for all (s, œw). The explicit construction of dynamic risk measures devel- 
oped here, making use of probability measures solutions to a martingale problem 
was first initiated in the unpublished preprint [4] in the Markovian case. We have 
introduced in [5] a general method to construct time consistent convex dynamic risk 
measures. This construction makes use of two tools. The first one is a set 2 of equiv- 
alent probability measures stable by composition and stable by bifurcation (cf. [5] 
Definition 4.1). The second one consists in penalties a; ;(Q), s < t defined for every 
probability measure Q in .Z, satisfying the local condition and the cocycle condition. 
The corresponding definitions are recalled in the Appendix. 
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4.1 Multivalued Mapping and Continuous Selector 


Definition 6 X denotes the quotient of IR} x 12 x IR" by the equivalence relation 
~: (t, 0, x) ~ (t, o, x) i£ t = t,x = x’ and o(u) = w'(u) Vu < t. The metric 
topology on X is induced by the one to one map from X into a subset of IR} x 2 x IR”: 
(t£, 0, x) > (t, cx x, x), 

where o **; x has been defined in Sect. 2.2 Eq. (3). 


The following observation is straightforward. 


Remark 2 The set X is equipped with the metric topology defined above. Then every 
progressively continuous map on IR} x £2 defines a unique continuous map on X. 
Furthermore every map continuous on the subset {(7, w, x), @ = w *, x} of IR} x 
Q x IR" defines also a unique continuous map on X. 


Recall now the definition of a multivalued mapping from X to Y. We use here the 
terminology chosen in [9]. Notice that the terminology used in [1] for multivalued 
mapping is correspondence. 


Definition 7 A multivalued mapping A from X into JR” is a map A defined on X 
such that for all (t, o, x) € X, A(t, œ, x) is a subset of JR”. It can have additional 
properties: 


1. Ais convex if V(t, w, x) € X, A(t, œ, x) is a convex subset of JR”. 
2. Ais closed if for all (t, œw, x), A(t, œw, x) is closed. 


Recall the following definition of a continuous selector (Definition 16.57 of [1]). 


Definition 8 A selector from a multivalued mapping A from X into /R" is a function 
s: X — IR" such that s(t, o, x) € A(t, c, x) for all (t, œw, x) € X. A continuous 
selector is a selector which is continuous. 


Recall the following definition from [1] (Definition 16.2 and Lemma 16.5): 


Definition 9 A multivalued mapping A from X into JR” is lower hemicontinuous if 
it satisfies the following equivalent conditions 


e For every closed subset F of IR”, A"(F) = {(t,@,x) € X : A(t, eo, x) C F} is 
closed 

e For every open subset V of IR", A'(V) = {(t,@,x) € X : A(t, w, x) QV zx fj is 
open 


Recall the following Michael Selection Theorem (cf. [1] Theorem 16.61) 


Theorem 2 A lower hemicontinuous mapping from a paracompact space into a 
Banach space with non empty closed convex values admits a continuous selector. 


Recall also that every metrizable space is paracompact (Theorem 2.86 of [1]). 


158 J. Bion-Nadal 


4.2 Stable Set of Probability Measures Associated to a 
Multivalued Mapping 


Inall the following, A is aclosed convex lower hemicontinuous multivalued mapping 
from X into JR”. In the following (X;) denotes the canonical process on 2 : For all 
càdlàg path w, X;(w) = w(t). (F;) is the canonical filtration. 

Given the progressively continuous matrix valued map a, given r > 0, and w € 2, 
we want to associate to A a stable set 2,.,.(A) of probability measures on (2, 2) 
all equivalent with the probability measure Q7 ,, on %,. Furthermore we want to 
construct a continuous function v on X. Therefore we start with continuous selectors 
à from A. 


Definition 10 Let a be progressively continuous bounded defined on JR x £2 with 
values in non negative matrices, such that a(t, w) is invertible for all (t, w). Let A be 
a closed convex lower hemicontinuous multivalued mapping from X into JR”. 


e Wedefine L(A) to be the set of continuous bounded selectors from the multivalued 
mapping A. 

e For given r > 0 and o € €2, the set 2, XA) is the stable set of probability 
measures generated by the probability measures OF. X € L(A) with A(t, œ") = 
X(t, o , X;(w’)) 


a,ar 
dOr w 


E 


T T 
Br = ext f (A(t, o), dX) — J (A(t, ^), a(t, wo )A(t, œ) dt] 


We give now a description of the set 2, JA). 


Definition 11 We define L(A) to be the set of processes H such that there is a finite 
subdivision 0 = so < = < sj < siz1:: < sy < co. There is a continuous selector 
ho, i; in L(A). And for all 0 < i < k there is a finite partition (A; j)jer; of 2 into Bs, 
measurable sets, and continuous selectors Aij in L(A) such that 


Vsi < U < Si41, Vo! € 2, E(u, wo’, x) = S xga. o , x) la, (o) 
jeli 

Vs, < u Vo! € Q, Tu, o, x) = S ju, w, X) l4; (o) 
jelk 


Vu < so, Vo! € Q, Blu, o', x) = Aoig(u, 0, x) — (13) 


Remark 3 Every process ji in L(A) is bounded strictly progressive and Y x A(R") 
measurable where 4 is the predictable sigma algebra. However there is no uniform 
bound. 
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Proposition 2 /. Leta be as above and tt € L(A). For all r > 0 and all w € Q2, 
there is a unique solution to the martingale problem for 2°“ starting from 
o at time r with (u, œw) = plu, w, o(u)). Furthermore for allr < s, the 
map o! > Q is Bs measurable and is a regular conditional probability 
distribution of Q4. given Bs. 

2. Given 0 < r, the set 2, (A) is the set of all probability measures QF'& for 
some process ji belonging to L(A). 


Proof Let u € L(A). There is a finite subdivision 0 = sọ < --- < Si < Sj41 < 
+++, < oo such that à is described by Eq.(13). Let r and w. We prove first by 
induction on k that there is a unique solution Q to the martingale problem for 2%“ 
starting from o at time r and that Q belongs to 2, XA). 

For k — 0 the result is true by hypothesis. 


Inductive step: Assume that k > 1 and that the result is proved for k — 1. Let Q bea 
solution to the martingale problem for 7“ starting from c at time r. Let Qs, œ bea 
regular conditional probability distribution of Q given Z;,. From [22] it follows that 
for Q almost all w’ in A, j, Qs, 0’ is a solution to the martingale problem for . Z^ ej 
starting from o/ at time sz. The martingale problem for ./^^^*j is well posed. Let 


^"^ be the unique solution to the martingale problem for ./^^*i starting from 
q gale p g 


Sk. C) 
on Ax j Q a.s. Thus for all £, 


: aA j 
c at time sz. It follows that Os, w = ov 


Sk C 


Eg $15) o) = 2, la,; (v) (&) 


jelk 
Eg |B) = M lA, E pins GEM) (14) 
jek rol 


On the other hand the restriction of Q to Zs, is a solution to the martingale problem for 
L4 where v € L(A) is associated to the subdivision (s;)9<;<% and v coincides 
with u on Zi, . From the induction hypothesis it follows that the restriction of Q to 
Bs, is uniquely determined, it coincides with Q7" and it belongs to 2, XA). 


The end of the proof of the inductive step follows then from Eq. (14), from the Zs, 
Mets (€) and from the definition of 2, (A). 
On the other hand it is easy to verify that the set [or 7" : HE L(A)} is stable. 


measurability of the map w’ — Q 


5 Construction of Penalties 


In the preceding section we have constructed for all given (r, c») a stable set of 
probability measures Eo M (A) associated to a multivalued mapping A. In this section 
we construct penalties o/;(Q) for all r < s < t and all Q € 2, SCA) making use of 
a function g. 
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Let g : IR* x 2 x IR" — IRU{+00} bea progressively measurable function. Let A be 
a closed convex multivalued Borel mapping such that for all (t, w, x) € IR* x Q2 x IR", 
(0) C A(t, o, x) C {y € IR"| g(t, v x, x, y) < oo]. Define f as follows: 


Vz e IR? f(t,o,z)— sup (—z.y — g(t, œw, y)) (15) 
ye A(t,w,X;(@)) 


The following lemma is straightforward: 


Lemma 1 For all (t, w), f(t, w, .) is a closed convex function which is the dual 
transform of the function g(t, w, .) where 


g(t, w, y) = g(t, o, y) if y € Alt, œ, Xi(o») 
= +00 else (16) 


For every (t, w) dom(g(t, œw, .)) = A(t, w, X;(@)) 

If g(t, w, 0) = 0 Y(t, w), f takes values in [0, cc]. 

If g takes values in [0, oo] and satisfies V(t, œ), infye A(t,w,X(o))8(t, œw, y) = 0 then 
for all (t, w), f (t, w, 0) = 0. 


Notice that, since A is aclosed convex multivalued mapping, replacing g by g, one 
can always assume that for all (t, œ), dom(g(t, œ, .) = {y € IR? |g(t, @, y) < oo] is 
closed, convex and equal to A(t, w, X;(@)). We assume this in all the remainder. 


Definition 12 1. g satisfies the following polynomial growth condition (GC1) if 
there is K > 0, m € N* and e > O such that 


Vy € Alu, o, X,(0)), |g, œ, y)| < K(1 + sup [Xs(@)|)" + Ilyll?-*) 07) 


2. g satisfies the growth condition (GC2) if there is K > 0 such that 
Vy € Alu, œ, Xu()), [gu 0, y)| < KA + lly) (18) 


Recall the following definition of BMO processes. 


Definition 13 Let C > 0. Let P be a probability measure. A progressively measur- 
able process u belongs to BMO(P) and has a BMO norm less or equal to C if for all 
stopping times r, 


oo 
EM Weleda) = c 
T 
Recall also from [15] that the stochastic exponential & (u) of a BMO process u 


is uniformly integrable, and that the BMO norms with respect to P and P(&(p).) 
are equivalent. Also from [15], for all C > 0 there is 1 < po < oo such that for all 
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BMO(P) process u with ||u||gmor) < C, the stochastic exponential & (jz) satisfies 
the reverse Hólder inequality: 


[EPE (PIZO < Kc& Q0; (19) 


Definition 14 Assume that g is non negative or satisfies one of the growth conditions 
(GC1) or (GC2). Let 0 < r < T. For all BMO(Q7 ,) process u A-valued, for all 
r <s <t <T, define the penalty a, ADI ) as follows 


as, (On) = Egea qd glu, œ, y(u, w))du| Bs) (20) 


We need to verify that the penalties are well defined for all BMO processes and that 
they satisfy the local property and the cocycle condition. (Definition introduced in 
[5], Definition 4.3 and recalled in the Appendix). 


Proposition 3 Assume that the process u belongs to BMO(Q; „). Let C such that 
IlIlamovgs,,) S C. 


el. Assume that g satisfies the growth condition (GC1). Then Eq.(20) defines a 
random variable in Ly, (Q7 „) for all 1 < p < oo, and for given p, the L,(Q7 w) 
norms of es (Qro) are uniformly bounded forr < s € t < T, for all u such 
that llallgmoos, <C. 

Qs, (Oa p Hy belongs also to Li (OF EA and the Lı (Q7: E) norms of ds, (OF o "n Hy 
are uniformly bounded for ||u||guoigs,) < Candr <s E t <T. 

2. Assume that g satisfies (GC2), then the random variables as, Dr 2) belong to 
Loo (Or o) and are uniformly bounded for II lBmovs ,,) <Candr<s<t< 
T. 

3. In case g is non negative, Eq. (20) defines a non negative Z; random variable. 

e Assume that g satisfies the growth condition (GC1) or (GC2). Then the penalty 

defined in (20) satisfies the cocycle condition for every Qro : Letr «stu 


s (Qo) = esi (QT) + Ege Qt ulr o NZBs) Q1) 


e The penalty defined in (20) is local on P(A) 
e If g(t,@,0) = 0 V(t,@) € IR* x Q, The probability measure OF o has zero 
penalty. 


Proof 1. Assume that the function g satisfies the growth condition (GC1). Without 
loss of generality one can assume that m > 2. Choose p; > | such that (2 — 
€)p1 = 2. Let q be the conjugate exponent of pı. 
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It follows from the conditional Hólder inequality and the equivalence of the 
BMO norms with respect to Q7 „ and Qro. that 


Ru J (1 + sup I1Xs D" lu o) dul.) 


< Ki CP [Ege C sup (1+ Iz" DL] ( — s) Q2) 


sx s^ <u 


Let po be such that Eq. (19) is satisfied for P = Q7 ,,. Let qo be the conjugate 
exponent of po. It follows from conditional Hólder inequality and (19) that 


Egg (sup + XS ID77]122,)) S KcEos,, C sup (1+ [Xs ID | Bs yn 


s<s' <t 


< KcEgs ( sup (1+ [IX [n0 |.) a Q3) 


Ss! «t 


for all j > 1. The first assertion of 1 of the proposition follows then from the 
Eqs. (22) and (23) and the inequality Ege (sup,<,<,(1 + |X| 5) < oo for all 
k > 2 ( [16], Chap.2 Sect. 5). m 

The second asssertion of 1. of the proposition follows from Eq. (22) and then 
from Eq. (23) applied with 2, equal to the trivial sigma algebra. 

Assume that g satisfies the the growth condition (GC2). Thus 


les (Q*2^)| < KE ge. ( i! (1+ [ia DI] Bs) 


The result follows then from the BMO condition. 
The case g non negative is trivial. 


The cocycle condition (21) follows easily from the definition (20) and the above 


integrability. 


We prove now that the penalty o is local on 2, w(A). Let p, v € L(A). The 


probability measures Q5 and Qro are equivalent to Qr ,. Letr <s <t 
and A be &,-measurable. Assume that for all X in L? (4,), Egan (X|45,)14 = 


Ege.» (X|&s)1a. It follows from the equality Sau 1, = œk], and the P x 


E(ap)s ^ ^. (av) 


AIR") measurability of ji and v that 1ys.¢law = Lijs silav Q7 o a.s. From (20) 
we get o, (Ora) 14 = ats. (Ofo ) la. Thus the penalty o is local on 2, oA). 


The last point follows easily from the definition of the penalty. 
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6 Time Consistent Dynamic Risk Measures Associated 
to Path-dependent Martingale Problems 


We change the sign in the classical definition of risk measures in order to avoid the 
minus sign which appears in the time consistency property for usual dynamic risk 
measures. In fact ps;(—X) are “usual” dynamic risk measures. 


6.1 Normalized Time-Consistent Convex Dynamic Risk 
Measures 


Proposition 4 Let 2, (A) be the stable set of probability measures defined in 
Definition 10. Assume that g is non negative, and that for all (u, e), g(u, œ, 0) = 0. 
Letr € s € t. The formula 


pst (Y) = esssupgaane ġ, „o (Eger (Y 4A) — as (QE) (24) 


where a (Orn) is given by Eq.(20) defines a normalized time consistent convex 
dynamic risk measure on L?? (Q, B, OF o): 

For given 0 < r < t and Y in LY (Q", B, y the process (05:7 Y))r<s<t admits 
a càdlàg version. 


Proof Notice that for all bounded Y, 
-IlYlloo < Eon, (YI) < prz (Y) < esssupgaare g, „o (Eqs (YI) < IY loo 


Thus for all r < s < t, ||o;7 (Y)lloo < |IY|loc. The first statement follows then 
from Definition 10, from Propositions 2 and 3 and from Theorem 4.4 of [5]. 

The proof of the regularity of paths which was given in [6] Theorem3 for normalized 
convex dynamic risk measures time consistent for stopping times can be extended 
to normalized convex dynamic risk measures time consistent for deterministic 
times. 


We have the following extension of the dynamic risk measure to random variables 
essentially bounded from below: 


Corollary 2 The definition of p; (Y) can be extended to random variables Y (¥;)- 
measurable which are only essentially bounded from below. 

psr LE) = limn—co py Y ^ n). For every Y essentially bounded from below, the 
process Gyr (Y)) is optional. 


Proof Let Y be &;-measurable and Q7 ,,-essentially bounded from below, Y is the 
increasing limit of Y, = Y ^n as n tends to oo. Define rd (Y) as the increasing limit 
of py? (Yn). As we already know that for given s and t, pj? (Y) defined on bounded 
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random variables by formula (24) is continuous from below, the extended definition 
coincides with the previous one on Q7 ,,-essentially bounded random variables. 

From Proposition 4 for every n one can choose a càdlàg version of the process 
051 (Yn). Thus the map (s, œ) > ppf (Y) = lim oj? (Yn) is measurable for the 
optional sigma algebra. 


6.2 General Time-Consistent Convex Dynamic Risk Measures 


In this section the function g (and thus the penalty) is not assumed to be non negative. 


Definition 15 Let Q be a probability measure on (£2, (Z);em, ). The multivalued 
mapping A is BMO(Q) if there is a map $ € BMO(Q) such that 


V(u, œ), suptllyll. y € Au, w)} < bu, w) 


In the following, po is chosen such that the reverse Hölder inequality (19) is satisfied 
for ao. 


Theorem 3 Let (r,@). Assume that the multivalued set A is BMO(Q; ,). Let 


2, (A) be the stable set of probability measures defined in Definition 10. 
Let r € s < t. Let 


pst (Y) = esssupgsanc ġ, „o Eora (Y 4A.) — o (QE) Q5) 


where asi (Ora) is given by Eq.(20) 


e Assume that g satisfies the growth condition (GC1). The above Eq. (25) defines a 
dynamic risk measure (os) on Ly (OF is (46,)) for allqo € p < co, (where qo is 
the conjugate exponent of po chosen as above). These dynamic risk measures are 
time consistent for stopping times taking a finite number of values. 

e Assume that g satisfies the growth condition (GC2). The above Eq.(25) defines 
a dynamic risk measure (p57) on Loo (OF ui (2,)) , and also on every Ly(Q* o» 
(45,)) for qo < p < œ. These dynamic risk measures are time consistent for 
stopping times taking a finite number of values. 


Proof Thereisaconstant C > Osuchthatforall Q77^ € 2, oA), lau auo, € 
C. It follows from the reverse Hólder inequality (19), that for all non negative measur- 
1 


able Y, Eran (Y|Bs) < Kc(Egs QIY L| 24,) ®. Thus Y — Egan (Y|&s) defines 
a linear continuous map on Lj (Q7 „) with values Lp (Qe w) for all qo < p < oo, and 
that for given p, the norms of these linear maps are uniformly bounded for A valued. 
From Proposition 3, it follows then that Eq. (25) defines a dynamic risk measure 
(ost) on Ly(Q£ o» (F:)) for all go < p < oo in case (GC1). Under assumption 
(GC2), equation (25) defines a dynamic risk measure E ) on Lp (QF, o: C) for 
all Lp, qo < p < oo. 
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The time consistency for stopping times taking a finite number of values follows 
from the stability property of the set of probability measures as well as the cocycle 
and local property of the penalties (cf. [5] in Loo case). The proof is the same in Lp 
case. 


7 Strong Feller Property 
7.1 Feller Property for Continuous Parameters 


We assume that the progressively measurable function g is a Caratheodory function 
on IR} x Q x IR", that is for all u, (w, x) — g(u, œw, x) is continuous. The support 
of OFF is contained in the paths w’ continuous on [r, co[ (Proposition 1) and which 
coincide with w on [0, r]. It follows that for every function A progressively continuous 
bounded, and all u > r, the function w > g(u, œw, A(u, «»)) is continuous on the 
support of Deo. We prove then the following Feller property for the penalty. 
Proposition 5 Let a be progressively continuous bounded such that a(s, œw) is invert- 
ible for all (s, w). Let X be progressively continuous bounded. Assume that g is a real 
valued Caratheodory function satisfying the growth condition (GC1) or (GC2). 


1. There isa strictly progressive real valued map L(g) on [O, t] x 2 x IR" continuous 
on ((s, œw, x), @ = w *s x) such that 


t 
Egua / gu, o, A(u, e )du) = L(g)(s,@,x) Vsxt wandx (26) 
SW SX Jf. 


2. Forall0Q <r < s < t, and œw € Q, there isa 22 -null set N such that for all 
w € NC, 


t 
Eo qd gu, o, Au, w'))du| Bs) (o1) = L(g)(s, i, Xs (o1) (27) 


Proof We only need to prove 1. 

Step 1: Assume that the function g is bounded. Then fi glu, wo’, A(u, w’))du is a 
continuous bounded function of o on the support of Que. The map A being pro- 
gressively continuous bounded, the continuity property for L(g) follows easily from 
Theorem 1. 

Step 2: general case. Notice that A is bounded. 

Thus under assumption (GC2), g(u, w, A(u, œ) is uniformly bounded and the con- 
tinuity property of L(g) follows from step 1. 
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Under assumption (GC1), let (sn, @n, Xn), On = On *s, Xn With limit (s, c, x), œ = 
w *, x. By definition the sequence wn *s, Xn has limit w *,s x. It follows from [3] that 
the set of probability measures 2 = (Q7, x, x, n € N} U {Qh x} is weakly 
relatively compact and thus tight. Thus for all n> 0, there is a compact set .# such 
that Q(£^) < n for all Q € 2. From the growth condition (GC1), the existence 
of a uniform bound for Eol f; a + SUPs<u \|X,||"")du]* for Q € 2 and the Hólder 
inequality, it follows that there is a progressively continuous bounded function gi 
such that for all Q in .Z, 


t 
Eo | (lgu, w, (aX) (u, w) — gı u, w)||))du < e. (28) 


The result follows then from step 1. 


We introduce now a class of ZZ, measurable functions on 2 satisfying a continuity 
condition derived from the progressive continuity condition that we have introduced 
for progressive functions and from the continuity property proved in Theorem 1. 


Definition 16 Let ¢ > 0. The function h defined on 2 belongs to @; if there is a 
function h on Q x IR” such that 


e hw) = h(w x, X, (0), X;(@)) 
e h(w, x) = h(a’, x) if w(u) = w (u) Vu «t 


and such that / is continuous bounded on {(@, x), o = o x) C 2 x IR" 


Corollary 3 Assume that a is progressively continuous bounded and that a(s, œw) 
is invertible for all (s, w). Let à be progressively continuous bounded. Leth € @;. 
Asssume that the penalty as; is given by Eq. (20) for some Caratheodory function g 
on IR} x Q x IR" satifying the growth condition (GC1) or (GC2). There is a strictly 
progressive map L®*(h) continuous on ((u, œ *, x, x), 0 € u < t} such that for all 
O<m<rs<t 


L^^(h)(t, w, x) = h(w, x) 
L'^(hy(r, c, x) = Eggo (h) — er (rios, s) 


TO EX 


= [Egga (a Br) — arx) Ono as. — Q9) 


Proof 'The result follows from Theorem 1 and from Proposition 5. 


7.2 Feller Property for the Dynamic Risk Measure 


Proposition 6 Let n in L(A) (Definition 11). Lett > 0 and h € ©. Asssume that the 
penalty as is given by Eq. (20) for some Caratheodory function g on IR} x Q x IR" 
satifying the growth condition (GC1) or (GC2). There is a strictly progressive map 
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®,,(h) continuous on ((u, œ, x), © = œ xu x, u x t} such that 6, (h)(t, œ, x) = 
h(@, x) for all w, and such that for all r < s < t, there is a process Vs in L(A) such 
that 


By (h) Gs, to, x) = Egan (h) — as (Qf (30) 
Eggs (MBs) — e. (Qoa Yo!) = iU) 0, X(')) Qu as. QD 


and 
Eggs (h| Bs) — ds (QR) < Egeas (h| Bs) — as (OEE) Qiu as. — 92) 


Proof The proof is done in two steps. The first step is the construction of ®, (/1) 
given u. The second step is the construction of v, given u and s. u belongs to L(A). 
From Definition 11, let 0 = so < = < Si < Sj41-- < Sk < Spyz] = & be a finite 
partition such that Eq. (13) is satisfied. 


e First step: construction of ®,,(h). We construct ®, (h) recursively on [s;, s;4.1[. 
Let s, be such that s, < t < Sn+1. From Corollary 3, for all j € Z, there is a 
strictly progressive map L®^nj (h) continuous on {(u, w, x), @ = @ *, x, u < t] 
satisfying Eq. (29). Let 

o, (Ih) s, v, x) = sup L^^ (h)(s, e, x) Vs € sy. t (33) 

jeln 
Let h,(o) = ,(h)(s,, c, Xs, (v)). The function hy belongs to Gs, and 
h (w, x) = d, (h) (Sn, o, x). Then we can proceed on [s,—1, Snl with An. We con- 
struct recursively the strictly progressive map 9, (h) continuous on ((u, œw, x), @ = 


w xu X, u X t}. Notice that for all s € [s;, si+1], there are ZZ, measurable sets 
(Cs jjer, such that 


o, (h)(s, w, x) = y» les (œ) LO (hi+1)(s, o, x) (34) 
Jeli 


Second step: Given s € [r, t], construction of the process v,. 
There is a unique k such that s €]s,, 5,41]. For i > k for all u €]s;, s;+1], define 


vs(u, œ) = M Ic, (0) ju, ©) (35) 


jel 


And for u €]s, 5x44], define 


vs(u, ©) = M 1c, (@)Anj(U, ©) (36) 


jelk 


168 J. Bion-Nadal 
and v,(u, œw) = ju (u, w) for all 0 < u < s. It follows from the construction of v; 
that the process v; belongs to L(A). It follows also recursively that for all i > k 
and o € 2, 

Ego (h) — as, (QS) = Pu lh) i, v, Xs (0) (37) 
and for all w € 2, 
Egeas (h) — ds, (Oso) = 4), w, Xs()) (38) 
By construction the following inequality is satisfied: 
Egan (h) — o (Q5) < d, D), o, Xs(@)) (39) 
It follows then from Proposition 2 that 


Egra (h| Bs) — os (Qro) S Eases UBs) — as, Qrig) Qr us A-S. 


and 


[Eggs (h| Bs) — osi (Qo) (o) = d DG, 0, oC) Qf, as. 


Theorem 4 Assume that the hypothesis of Theorem3 are satisfied and that g 
is Caratheodary function. The time consistent dynamic risk measure s. \r<s<t 
defined on LY (R, B, Q7 ) by Eq. (25) satisfies the following Feller property: For 
every function h € G, there is a progressive map R(h) on IR} x 2, R(h)(t, w) = h(a), 
such that R(h) is lower semi continuous on ((u, œ, x), u € t, œ = @*,x} and such 
that the following equation is satisfied 


Vs e [r,t], Yo! € 2, pÈ? (h) = RO), o/) (40) 
VO<r<s<t, Pst (h)(a’) = R(h)(s, w’, o (s)) Os as. (41) 


(R(h) denotes the strictly progressive map on IRs x Q x IR" associated to R(h) in 
the one to one corrrespondence introduced in Sect. 2). 


Proof For all u € L(A), let ®,,(h) be the strictly progressive map, continuous 
on ((u,c0,x), o = oy x, u < t} constructed in Proposition 6. Let R(h) = 
SUP ez (A) @,,(h). The function R(h) is then lower semi continuous on {(u, œ, x), @ = 
W*yX, u € t). Let R(h)(u,o) = R(A)(u, o, X,(@)). Equations (40) and (41) follow 
easily from Proposition 6. 
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8 Existence of Viscosity Solutions for Path-dependent PDEs 


6.1 Existence of Viscosity Supersolutions 


Recall that A is a closed convex lower hemicontinuous multivalued mapping from 
X into /R" (Sect. 4.2). Let f be the convex conjugate of g defined as 


f(o,z)-— sup (z*y— g(u,@, y)) (42) 


yeA(u,o) 


We prove now that for all h € @ the map R(h) of Theorem4 leads to viscosity 
solutions for the following semi-linear second order PDE. 


—d,v(u, w) — .Z'v(u, w) — f (u, w, a(u, e) Dyv(u, @)) = 0 
v(t, w) = f (o) (43) 
Lv(u, w) = 3Tr(a(u, o)D? (v)(u, @)) 


Theorem 5 Fix (to, wo). Assume that the mutivalued set A is BMO(Qy, wy) (Def- 
inition 15). Assume that the function g satisfies the preceding hypothesis (g is 
a Caratheodory function and satisfies the growth condition (GCI) or (GC2)). 
Assume furthermore that g is upper semicontinuous on ((s, œw, y), (s,o) E X, y € 
A(s, e, w(s)}. For all r and a, let (p. be the dynamic risk measure given by 
Eq. (24) where the penalty satisfies Eq. (20). 

Let h € G;. The function R(h) is progressive and R(h) is lower semi continuous on 
(Q4, w, x), o = oy x, u < t} (Theorem4). R(h) is a viscosity supersolution of the 
path-dependent second order partial differential equation (43) at each point (to, wo) 
such that f (to, wo, a(to, «o)z) is finite for all z. 


Proof Let xo = wo(to). From Theorem4, the function R(h) is progressive and R(h) 
is lower semi continuous on ((u, w, x), @ = @*,x, u < t}. We prove first that R(h) 
is bounded from below on some D; (fo, o). R(h) (u, w) = Ega | (h) — onu (Qio). For 


all given k > 2, Egs , (Sup; «(1l + X105) is uniformly bounded for (u, œw) € 
D, (to, wo). The result follows then from either the (GC1) condition or the (GC2). 
Let $ progressive, $ in EPOR} x £2 x IR") such that ¢ (to, wo) = R(h) (to, wo) 
and (to, wo) is a local minimizer of R(h) — $ on D, (to, wo) for some 7 > 0. 


e Step 1: Continuity properties 
By hypothesis f (fo, wo, a(to, 0) Dx$ (fo, wo, xo)) < oo. Thus for all ¢ > 0, there 
is Ag € A(to, wo) such that 


DG (to, wo, xo)'a(to. 9)À9 — (to. wo, A9) > f (to, wo, alto, o) Dx (to, wo, xo)) — € 
(44) 

The multivalued set A is assumed to be lower hemicontinuous. It follows that 

for all K > 0, Ax is also lower hemicontinuous, where Ak(u,co,x) = {z € 
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A(u, w, x), ||z|| < K}. Choose K > ||Ao||. From Theorem 2, there is thus a contin- 
uous bounded selector A (u, w, x) of Ax defined on X such that A(to, wo, xo) = Ao. 
From the upper semi-continuity condition satisfied by g, and the continuity of the 
map A, it follows that for all € > 0, there is 7} > 0, such that for f < u < t < 
to + m1, d(@, c) < nı and ||o(t) — xoll < nı, 


g(t, w, A(t, œ, w(t))) — g(to, o. Xo) «€ (45) 


From the continuity of the function A, the hypothesis $ € gl 9? and the progres- 
sive continuity of a, there is n2 such that for tọ < u < t < to + m, ||x — xoll < mo 
and d(@ *, x, v) < N2, 


laud (u, w, x) + 5 Trace D35alu, w, x)) + (Drp aXX, o, x)— 
E 1 za "T 
up (to, wo, x9) + z Irace(Dz $a) (to. w0, X0)) + Dx ato, wo, xo)àol € € (46) 


The maps a and à are bounded. It follows from [22] that there is 0 < a < 
inf (n, 71, 2) such that 


Qr (A) <e with A — {o | sup — leu) — exto)I| = inf Gi. m. m) ] 
to xusto4- o 
(47) 


Let C = {@, w(u) = wolu) VO < u € to, SUPy<u<n+a leu) — eo(t9)l| < 
inf (n, m. n2)} 

Step 2: Time consistency 

For all 0 < B <a, let ô be the stopping time 6 = f 1c. The stopping time ô takes 
only 2 different values. By definition of the probability measure oss it follows 


from (47) that (d (C) » (1 — &). 
The dynamic risk measure (a aee is time consistent for stopping times 
taking a finite number of values, thus 


Prost”) = Poors Onti) (48) 


From Theorem 4, the lower semi continuous function R(h) satisfies: (p 


hlo) = Rü)(to + ô, w, Xi ca (0)) Qu, aS., and oCh) = R(h)(to, w0). 
Let A be the continuous function defined in step 1. By hypothesis R(h) > $ on 
D, (to, wo). For all o € C and u < P, (u, œ *, c(u)) € Dy(to, wo). Since the 
functions R(h) and ¢ are progressive it follows that R(h) > $ on [fo, to + 8] x C. 
It follows then from the equality $ (to, wo) = R(h)(to, wo) and from the definition 


19,c0 
of Pio t4 that 


to+ô 
$ (fo, wo) = Bopa [I (to + ô, œ, Xip+5(@)) -f gu, w, à (u, w, Xu(w)))du] 
(49) 
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e Step 3: Martingale problem 


The probability measure D is solution to the martingale problem for 74“ 


starting from wo at time fo. The function ¢ is strictly progressive and belongs to 
el 0-2 OR. x §2 x IR"). It follows from the martingale property proved in [3] that 


to+ô 
0 > Egoa if (dup (u, co, Xu(@)) + l Trace(D2 6), w, X,(w))))du] 
19.09 to 


194-8 
+ Equi [ l (DPT o, Xalo) — gu, o, Ta, o, X, (0)))du] (50) 
0:0 Jtg 


Step 4: Conclusion 
Divide (50) by £ and let £ tend to 0. It follows from the definitions of C and 6 and 


the Eqs. (44)-(46) proved in step 1 and Qt (C) > 1 — e that 


— 3u (to, wo, xo) — -Z$ (to, wo, xo) — f (to, wo, o (to, v9) Dx (tg, wo, x9)) = —2&(1—6) 


This gives the result. 


6.2 Existence of Viscosity Subsolutions 


In this section we will assume that the set A has some uniform BMO property. 


Definition 17 The multivalued mapping A is uniformly BMO with respect to a if 
there is a non negative progressively measurable map g and C > 0 such that for all 
O<s, 

sup{|A|, A € A(s, w)} < qs, o) ($1) 


and such that for all (s’, w), the unique solution to the martingale problem .Z/* 
starting from œ’ at time s' satisfies: 


oo 
D (f o(s, @)?ds) < C (52) 
s! 
Of course the above condition is satisfied if sup, i os, c)?ds — oo. 


The name “uniform BMO” property is justifed by the following result. 


Lemma 2 Assume that the multivalued mapping A is uniformly BMO with respect 
to a. Then for all (r, w) and all process u A valued such that ji is A x A(R") 
measurable, u belongs to BMO(Q7 „) and ||| [BMO(Q? ,,) <C 


Proof Let r, œw and a stopping time t > r. It follows from [22] and from uniqueness 
of the solution to the martingale problem for 7° starting from o/ at time t(w’) that 
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for Q7 „ almost all o, 


oo Cc 
Eo | eG dui) = Eo, (i ydo = C 
T TEMPOR t(o) 


Leth € G;. The function R(A) is lower semi continuous in viscosity sense but itis not 
necessarily upper semi continuous. Therefore we need to introduce the upper semi 
continuous envelope of R(h) in the viscosity sense according to Sect. 2.3. Denote it 
R(h)*. 

R(h)*(s, œ) = lim sup{R(h)(s’, o), (s, e) € D,(s, @)} 


n0 


Theorem 6 Let (to, wo) be given. Assume that the mutivalued set A is uniformly 
BMO with respect to a. Assume that the function g is a Caratheodory function sat- 
isfying the growth condition (GC1) or (GC2) and that the Fenchel transform f of g 
is progressively continuous. 

Leth € 6. The map R(h)* is progressive, R(h)* is upper semicontinuous in viscosity 
sense. R(h)* is a viscosity subsolution of the path-dependent second order partial 
differential equation (43). 


Proof Let x9 = @o(to). The progressivity of R(h)* follows from the equality 
D,(s,@) = Dy(s, c *, w(s)). The upper semicontinuity property follows from 
the definition of R(h)*. We prove first that R(A) is bounded on some D, (fo, wo). 
R(h)(u, œ) = sup, (Ega.au (h) — du, (Qua). For given k > 2, Es, (sup, z,«,(1 + 
IP DED) is uniformly bounded for (u, w) € D;. The result follows then from either 
the (GC1) condition or the (GC2), and from the uniform BMO hypothesis with 
similar arguments as in the proof of Proposition 3. 

Let $ progressive, $ € [An ?R, x Q x IR") such that $ (to, wo) = R(h)* (to, wo) 
and such that (to, co) is a maximizer of R(h)* — $ on Dy (to, wo). 


e Step 1: Making use of the progressive continuity property of a, f and of the 


regularity of 9. for all n € N*, there is y, > 0, fo + 9, < t such that for 
to < u X to + Mn, d(@o, ©) < Ny and ||xo — e (u)|| < Nn, 


= = 1 
FU, c, (4Dxb)(u, w, co(u))) < f (fo, wo, (4Dx) (to, wo, xo)) + — (53) 


loud (u, w, w(u)) + Trace D2Ga) (u, w, w(u)))— 


= 1 = 1 
ud (to, coo, xo) + 5 Trace(D; $3) (fo, wo, x0))| < — (54) 
Without loss of generality one can assume m, < n. 
The matrix valued process a being bounded, it follows from [22] that there is hy, 
such that for all fj € s < to + n and w € D,(to, wo), 
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OS, sup |o) — e5)ll > T» «€ (55) 


s<u<sthy 


Without loss of generality one can assume that hy, < T. 
For all n > 0 choose (tn, c) € Dm (to, wo) such that lim, oo R(A) (t4, @n) = 


R(h)* (to, c9) and 


h 
$ (tns On) < RUD tns on) + — (56) 
Let C, — {o’, SUD; <u<ty thn Iw’ (u) — On(tn)|| < T. e (v) = ov), Vv < th}. 
It follows from Eq. (55) that OF ne (Cn) > 1— & forall n. From Lemma 2 there is a 
constant C’ > 0 such that ||au||gyo(g« „) < C' forall u € L(A) and all n. From 
[15] it follows that there is po such that the reverse Hólder inequality (19) is satisfied 


" L 
for all £ (ap) and thus that for all u € L(A), and all n, Q7 7 (CS) < Ke(e)% 
where qo is the conjugate exponent of po. The constants qo and K, depend neither 
on u nor on n. Thus & can be chosen such that 0r. (Ca) > 5 for all n and all 
u € L(A). 

Let ôn be the stopping time taking only two values h, and 0 defined by ôn = hn1c,- 
Step 2: Time consistency 
Making use of the time consistency of the risk measure (or ^), and of the fol- 
lowing equations deduced from Theorem 4, 

RA) (tn, On) = pry" (h) 


tn, 


pee O= ROD s in) OF p aas. 


it follows that 
ROM) (ts. On) = prt, s (RA) (y + ôn, -)) 


tn,@n 


It follows from the definition of the risk measure (oy NS 


), that for all n there is 
a process up in LCA) such that 


h 
RM) (ts, On) < Egaeun (RO) (tn + n. -)) — 0t, 4, Qf, ) + Ps (57) 


For all » € Cn, (tn + ha, e *t +h, o (tn + An)) € Ds (to, wo). The functions R(h) 
and ¢ are progressive and satisfy R(h) < R(h)* < $ on Dy (to, wo). It follows that 


R(h) (t + ôn, 0) < d (t, + ôn, 9) Vo" € C, (58) 


From Eqs. (56)-(58), it follows that 


tn+ôn h 
P (tn, On) < Egsenn lo (tn + ôn, -) -f gU, o, un (u, o ))du] + 2 (59) 
n.n t 


n 
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e Step 3: Martingale problem 
Given (tn, c), the probability measure p " is solution to the martingale prob- 


lem 7%" starting from cy at time tj. The strictly progressive function $ belongs 
to B>? It follows from [3] and from Eq. (59) that 


tntôn 1 D 
0 < Egaem| (9, (u, w, (w (u))) + 5 lrace(Dia)(u, c , w (u)))du] 
nn tn 


tn+ôn e h 
+ Egi f (Drp Qu, o, e) alu, oun Qu, o") — glu, o, unlu, o) du] + 2 
n.n tn 
(60) 


By definition of f it follows that 


tn+ôn 
0< E ann if (8, Qu, w, e» (u))) + ; Trace( (Da) (u, œ, o (u)))du] 
n.On t 


n 


tn+ôn - h 
+ Equann if f(u, o, (aD, 9) (u, wo, w (u)))du] + 2— (61) 
n,n Ín 
e Step 4: Conclusion 
Divide equation (61) by hy and let n tend to oo. The result follows from step 1, 


the inequality Oo "(C4) = 1 for all n and ôn = hylc,,. 


>On 


8.3 Existence of Viscosity Solutions on the Set 
of Continuous Paths 


On the set of continuous paths @ (IR. x IR") we consider the uniform norm topology. 
In this section we assume that the function a is only defined on R4 x (IR, x IR") 
and that it is continuous. For every continuous function A on the space of continuous 
paths (IR, x IR") such that h(w) = h(a’) if co(u) = œ'(u) for all u < t, the 
corresponding function R(h) is constructed as above. In this case the function R(/1) is 
defined only on the set of continuous paths (more precisely on [0, t] x @ (IR, x IR")). 
We make use of the definition of viscosity solution on the set of continuous paths 
introduced in Sect. 2.4 (Definition 4). To prove that R(/1) is a viscosity supersolution, 
to define R(/1)* and prove that it is a viscosity subsolution, we do not need to extend 
the functions a nor R(A). Indeed as the support of the probability measure Q% ,, is 
contained in the set of continuous paths, we just need to use the restrictions of the 
function $ and of its partial derivatives to the set IR} x V (IR. x IR") x IR". The 
proofs given for Theorems 5 and 6 can be easily adapted to prove the analog result 
in the setting of continuous paths. 

Thus the setting of continuous paths can be considered as a "particular case" of the 
setting of càdlàg paths. 


Dynamic Risk Measures and Path-Dependent ... 175 


9 Conclusion and Perspectives 


We have introduced new notions of regular solutions and viscosity solutions for path- 
dependent second order PDEs, Eq. (1), in the setting of càdlàg paths. In line with the 
recent literature on the topic, a solution of (1) must be searched among progressive 
functions, that is path-dependent functions depending at time ¢ on all the path up to 
time t. However, the notions of solutions introduced in the present paper differ from 
previous notions introduced in the literature on two major points: 


e In contrast with other papers, we consider on the set 2 of càdlàg paths the Sko- 
rokhod topology. £2 is thus a Polish space. This property is fundamental for the 
construction of solutions for path-dependent PDE that we give in the present paper. 

e The notions of partial derivative for progressive functions that we introduce are 
defined in a very natural way by considering a progressive function of two variables 
as a function of three variables. 


In addition we introduce also a notion of viscosity solution on the set of continuous 
paths. 

Making use of the martingale problem approach to path-dependent diffusion 
processes, we then construct time consistent dynamic risk measures o;;^. The stable 
set of probability measures used for the construction of o;;^ is a set generated by 
probability measures solution to the path-dependent martingale problem for 2°" 
starting form w at time r. The path-dependent progressively continuous bounded 
function a is given and takes values in the set of invertible non negative matrices. 
The path-dependent functions u are progressively continuous and vary accordingly 
to a multivalued mapping A. This construction is done in a very general setting. In 
particular the coefficients u are not uniformly bounded. We just assume that they 
satisfy some uniform BMO condition. To construct the penalties, we make use of a 
path-dependent function g satisfying some polynomial growth condition with respect 
to the path and some L» condition with respect to the process jz related to the BMO 
condition. In contrast with the usual setting of BSDE, in all this construction no Lip- 
schitz hypothesis are assumed. Notice however that the Lipschitz setting can also be 
studied within our approach: instead of starting with progressively continuous maps 
a and u, one could start from a subfamily of maps which, for example, satisfy some 
uniform continuity condition (as K Lipschitz maps). 

We show that these risk measures provide explicit solutions for semi-linear path- 
dependent PDEs (2). First, we prove that the risk measures p;;” satisfy the following 
Feller property. For every function h Z/, measurable having some continuity property, 
there is a progressively lower semi-continuous function R(/1) such that js (h(X;)) = 
R(h)(r, @) and R(A)(t, œ) = h(t, œw). Next, the function R(h) is proved to be a 
viscosity supersolution for a semi-linear path-dependent PDE (2), where the function 
f itself is associated to g by duality on the multivalued mapping A. We prove also 
that the upper semi continuous envelope of R(/) is a viscosity subsolution for the 
path-dependent semi linear second order PDE (2). 

Here we have proved the progressive lower semi continuity for R(h). To prove the 
progressive continuity property, additional hypothesis should be added, e.g. Lipschitz 
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conditions. Another way of proving the continuity is to apply a comparison Theorem. 
The study of comparison theorems and of continuity properties in this setting, as well 
as the study of solutions to fully non linear path-dependent PDE will be the subject 
of future work. 
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Appendix 


An important way of constructing time consistent dynamic risk measures is to con- 
struct a stable set of equivalent probability measures and to define on this set a penalty 
which is local and satisfies the cocycle condition [5]. Recall the following definitions 


Definition 18 A set 2 of equivalent probability measures on a filtered probability 
space (2, Z, (F;)) is stable if it satisfies the two following properties: 


1. Stability by composition 
For all s > 0 for all Q and R in 2, there is a probability measure S in 2 such 
that for all X bounded 4-measurable, 


Es(X) = Eg(Er(X|4s)) 


2. Stability by bifurcation 
For all s > 0, for all Q and R in 2, for all A € Z,, there is a probability measure 
S in 2 such that for all X bounded Z-measurable, 


Es(X| Fs) = lAEQQX|-7,) + lacEr(X|Fs) 


Definition 19 A penalty function o defined on a stable set 2 of probability measures 
all equivalent is a family of maps (o,;), s < t, defined on 2 with values in the set 
of Z/,-measurable maps such that 


(i) o is local: 
For all Q, R in 2, for all s, for all A in Zs, the assertion l4Eg(X|4;) = 
lAER(X|Z2,) for all X bounded 4, measurable implies that 140;,,(Q) = 
lads (R). 
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(ii) o satisfies the cocycle condition: For all r < s < t, for all Q in 2, 


a, (Q) = a, s(Q) st Eg (As (OIF) 


Recall the following result from [5]. 


Proposition 7 Given a stable set 2 of probability measures and a penalty (o, +) 
defined on 2 satisfying the local property and the cocycle condition, 


Pst (X) = esssupge 2 (EQ(X|.7,) — as (Q)) 


defines a time consistent dynamic risk measure on Lo5(€2 , B, (28,)) or on L,(€2, B, 
(¥;)) if the corresponding integrability conditions are satisfied. 
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Pricing CoCos with a Market Trigger 


José Manuel Corcuera and Arturo Valdivia 


Abstract Contingent Convertible Bonds, or CoCos, are contingent capital instru- 
ments which are converted into shares, or may suffer a principal write-down, if certain 
trigger event occurs. In this paper we discuss some approaches to the problem of pric- 
ing CoCos when its conversion and the other relevant credit events are triggered by 
the issuer's share price. We introduce a new model of partial information which aims 
at enhancing the market trigger approach while remaining analytically tractable. We 
address also CoCos having the additional feature of being callable by the issuer at a 
series of pre-defined dates. These callable CoCos are thus exposed to a new source 
of risk—referred to as extension risk—since they have no fixed maturity, and the 
repayment of the principal may take place at the issuer's convenience. 


Keywords Contingent convertible + Coco bond : Callable bond * Extension risk 


1 Introduction 


The Basel Committee on banking Supervision was created in 1974, after the collapse 
of the German Bank Herstatt, with the aim of establishing prudential rules of trading. 
During the 1980s this committee was concerned with the big moral hazard of Japanese 
banks that distorted the competition among countries. In 1988 it formulated a set of 
rules, so called Basel I, to stabilize the international banking system. Basically the 
main rule was that each bank should hold a minimum of 8 96 of its total assets, where, 
for the valuation of the assets it was used some weights reflecting the credit risk of 
each asset. These measures produced a credit crunch and some criticism appeared, 
mainly related with the weights used to measure the risk of the different assets. 
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These weights only took into account the kind of institution borrowing or issuing the 
security and not what the spreads observed in the market. To amend the first Basel 
accords and take into account the market risk and interest-rate risk, it was started a 
process that concluded in 2004 with new rules, Basel II. These agreements are more 
complex and consider not only new rules for capitalization, with the introduction 
of VaR methodology, but also supervision and transparency rules. The regulator 
calculated the weights on the basis of the formula 


K =LGD x ® ee). | E $-(0.999| — PD x LGD, | (1) 
= x x . E x , 
JI—R 1—R 


where ® is the CDF of the standard normal distribution, LGD is the loss in case 
of default, PD is the probability of default, and R is the correlation between the 
portfolio of loans and a macroeconomic risk factor, see [24] for an explanation of its 
underlying model. To determine the different parameters, banks were allowed to use 
their own models. 

In 2007 a financial crisis, originated in the U.S. home loans market, quickly spread 
to other markets, sectors and countries, forcing the Federal Reserve and the European 
Central Bank to intervene in response to the collapse of the interbank market. This 
gave rise, in 2010, to new regulation rules, known as Basel III, that would change 
the financial landscape. Some securities were not going to be allowed anymore as 
regulatory capital and supervisors put emphasis in the fact that capital regulatory 
should have a real loss absorbing capacity. This is when Contingent Convertibles 
(CoCos) started to play an important role. 

In 2002 Flannery proposed and early form of CoCo that he called Reverse Con- 
vertible Debentures, see [22]. The idea was that whenever the bank issuing such 
debentures reaches a market-based capital ratio that is below a pre-specified level, a 
sufficient number of said debentures would convert into shares at the current market 
price. Later, in [23], he updated the proposal and named these assets as Contingent 
Capital Certificates. The idea behind was in agreement with what [19] wrote in 
The Prudential Regulation of Banks. In this work they formulated the representation 
hypothesis. According to this hypothesis prudential regulation should aim at replicat- 
ing the corporate governance of non-financial firms, that is, acting as a representative 
of the debtholders of bank, regulation should play the role of creditors in nonfinancial 
institutions. 

A Contingent Convertible is a bond issued by a financial institution where, upon 
the appearance of a trigger event, related with a distress of the institution, either an 
automatic conversion into a predetermined number of shares takes place or a partial 
write-down of the bond's face value is applied. It is intended to be a loss absorbing 
security in the sense that in case of liquidity difficulties it produces a recapitalization 
of the entity. 

Basel III, among other regulating measures, proposed the inclusion of CoCos as 
part of Additional Tier 1 Capital, where Tier 1 is, roughly speaking, the capital or the 
assets that the entity have, for sure, in case of crisis, and consists of Common Equity 


Pricing CoCos with a Market Trigger 181 


Tier 1 and Additional Tier 1. Chan and van Wijnbergen [5] affirm that the inclusion 
of Cocos in Tier 1 is a likely factor in the increase of CoCoissuances. In December 
2013, the CoCo market had reached $49 bn in size in Europe. 

Itis a controversial issue if CoCos are a stabilizing security. Koziol and Lawrenz 
[28] show that, under certain modelling assumptions, if CoCos are part of the capital 
structure of the company equity holders can take more risky strategies, trying to 
maximize the value of their shares. In their work Koziol and Lawrenz use a low 
level of the asset price of the company as a trigger for the conversion. Chan and 
van Wijnbergen [5] point out that conversion can be seen as a negative signal by 
the depositors of a bank and to produce bank runs. They also argue that far from 
lowering the risks, CoCos can increase even the systemic risks. On the other hand 
[20] defend that CoCos is an appropriate solution that does not lead to moral hazard 
provided that conversion is tied to exogenous macroeconomic shocks. 

There is also disagreement about how to establish the trigger event. It is perhaps 
the most controversial parameter in a CoCo. Some advocate conversion based on 
book values, like the different capital ratios used in Basel III. Others defend market 
triggers like the market value of the equity. So far the CoCos issued by the private 
sector are based on accounting ratios. 

The market for contingent convertibles started in December 2009 when the Lloyds 
Banking Group launched its $13.7 bn issue of Enhanced Capital Notes. Next in line 
was Rabobank making its first entry in the market for contingent debt with a €1.25 bn 
issue early 2010. After this, things turned quiet until February 2011, when Credit 
Suisse launched its so-called Buffer Capital Notes ($2 bn). This Credit Suisse issue 
was done on the back of the new regulatory regime in Switzerland. This was called 
the “Swiss Finish" and it required the larger banks such as UBS and Credit Suisse 
to hold loss absorbing capital up to 19 % of their risk weighted assets, see [11]. This 
capital had to consist of at least 10% common equity and up to 9 % in contingent 
capital. In 2014 a number of banks issued CoCos, including Deutsche Bank and 
Mizuho Financial Group. 

From a modelling point of view and sometimes depending on the trigger chosen 
for the conversion, usually a low level of a certain index related with the asset, the 
debt or the equity of the firm, one can follow an intensity approach or a structural 
approach to model the trigger. For an intensity approach for modelling the conversion 
time, see for instance [8, 16]. This approach is especially useful when pricing CoCos 
is the main interest, it is a kind of statistical modelling of the trigger event. In fact 
what one models is the law of the conversion time. In the structural approach for 
modelling the trigger, one models the random variable describing the conversion 
time and one relates it with the dynamics of the assets, debt, or equities. It is a more 
explanatory approach, where one can use the observed dynamics of certain economic 
facts to describe the conversion time. 

In the structural approach one can use a market trigger based on a low level of 
the equity value. This approach is very appealing because the market value of equity 
is an observable economic variable whose dynamics can be modelled in order to fit 
historical data. At the same time it allows to obtain close pricing formulas, like in 
[13], and to define an objective trigger that can be observed immediately. Cheridito 
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and Xu [7] also use this trigger and show that pricing and hedging problems can be 
treated for quite general continuous models and barriers and that solutions can always 
be obtained, at least numerically, using Feymann-Kac type results, translating the 
problem of pricing into a problem of solving a series of parabolic partial differential 
equations (PDE) with Dirichlet boundary conditions. 

One argument against accounting triggers is that monitoring is not continuous, 
there is always a delay in the information. Moreover, in the recent crisis these triggers 
did not provide any signal of distress in troubled banks. On the contrary, when using 
market triggers, there exist the risk of market manipulations of the equity price trying 
to force the conversion or undesirable phenomena like the death-spiral effect. In [17] 
authors propose a system of multiple triggers to avoid the death spiral, whereas in 
[13] a system of coupon cancellations is proposed in order to alleviate this effect. 
Sundaresan and Wang [39] analyze this kind of trigger and find that to use low equity 
values as trigger is not innocuous. It can have destabilizing effects in the firm. Their 
reasoning is roughly speaking the following. Suppose that (A;);>9 represents the 
aggregate value of the assets of the company, (D,);>0 its aggregate debt, (C;);-0 
the aggregate value of the CoCos, and (L;);+0 the aggregate liquidation value of the 
CoCos issued by the firm. Set t for the conversion time, and assume that it happens 
when the (aggregate) equity value, (E;);>0 is lower that some level say (H;);>0. Since 
equity value is the residual value of the asset, at any time f < t, we will have two 
possibilities: 

A; — Di — C; > H, ift <rt (2) 


or 
A, — D; — Li < H, ift = T. (3) 
This gives that, at any time f < T, 
Di + Cy < Aj < Di + Li. (4) 


So, if C, > L, there is not any possible value for A; and if C, < L there are 
multiple values, all of these allowing, according to [39], potential price manipulation, 
market uncertainty, inefficient allocation and unreliability of conversion.Obviously 
if C, = L,, that is, if there is no jump on the wealth of CoCo’s investors at the 
conversion time, then the equilibrium is possible, but this is considered non realistic 
and even problematic, since a punitive conversion for shareholders could help to 
maintain the market discipline. As a possible remedy to this situation [32] propose a 
trigger based not only in the value of the equities but also in the value of the CoCos. 
They also propose to include, in the CoCo contract, a PUT option of the issuer on 
the equities, in case of conversion, to avoid market price manipulations. 

Another possibility, in the structural approach, is to use a low value of the asset 
value as a trigger. It is also quite appealing, since it allows to consider the whole 
capital structure of the company, to study the effect of CoCo debt in the equity value 
and to obtain the optimal conversion barrier for the shareholders. See for instance 
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[6, 28] or [2]. In this latter paper they also consider other triggers aiming at providing 
a proxy for regulatory triggers. 

Nevertheless in all these papers authors consider a fixed maturity of the CoCo 
bond. However bonds often do not just have a legal maturity but can have also 
different call dates. In such cases, the bond can be called back by the issuer at these 
dates prior to the legal maturity. This risk of extending the life of a contractis what we 
call extension risk. This has been treated for the first time in [18] using an intensity 
approach and in [14] using a structural approach. 

In this paper we review our work on the topic of pricing CoCos, and we introduce 
new issues like delay in the information and jumps. We always consider low values of 
the stock as triggers and CoCos that convert totally into equities in case of conversion. 
The paper is organized as follows. The contract features are specified in Sect. 2. In 
addition, a model-free formula for the CoCoprice is presented in order to establish the 
general pricing problem. In Sect. 3 a model with stochastic interest rates is studied. 
A closed-form formula for the price is given and subsequently used in order to 
study the Black-Scholes model and its Greeks. In Sect. 4 two advanced models are 
discussed. On the one hand, the stochastic volatility Heston model is incorporated 
to the share price dynamics, and the correspondent prices are later on obtained by a 
PDE approach. On the other hand, an exponential Lévy model is proposed, and the 
obtainment of the correspondent prices is addressed by a Fourier method exploiting 
the so-called Wiener-Hopf factorization for Lévy processes. In Sect. 5 we introduce 
a new trigger model which aims to describe the delay of information present in 
accounting triggers. Finally, in Sect. 6 we show how the original pricing problem is 
modified when no fixed maturity is imposed on the CoCo. This variation leads to 
what we call CoCos with extension risk, and the pricing problem includes solving 
an optimal stopping time problem which, even in the Black-Scholes, turns out to be 
far from straightforward. 


2 The Pricing Problem 


The definition of a CoCo requires the specification of its face value K and maturity 
T, along with the random time t at which the CoCo conversion may occur, and 
the prefixed price C, at which the investor may buy the shares if conversion takes 
place. We refer to t and C, as conversion time and conversion price, respectively. 
The quantity C, :— K/C, is refered to as conversion ratio. Assuming m coupons 
are attached to the CoCo, then we further need to specify a series of credit events 
that may trigger a coupon cancellation. Denote by r1, ..., Tm the random times at 
which the aforementioned credit events may occur. Then the whole coupon structure 
(cj, Tj, Tj) of the CoCo is defined, in such a way that the amount cj is paid at 
time 7;, provided the v; > 7;. We establish that the last coupon is paid at maturity 
time, i.e., Tm :— T. The coupon cancellation feature was introduced in [13] in order 
to alleviate the so-called death-spiral effect exhibited by the traditional CoCo—see 
details in Sect. 3.1. Thus it is assumed that coupon cancellation precedes conversion 
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according to T] > --- > Tm => T. Of course it suffices to set t; = --- = Tm = oo if 
this feature were to be excluded from modelling. 

It will be assumed that the issuer of the CoCo pays dividends according to a 
deterministic function «. From the investors side, it seems to be reasonable to assume 
that no dividends are paid after the conversion time t. Thus hereafter we shall assume 
that the following condition holds true. 


Condition (F). There are no dividends after the conversion time T. 


Remark 1 Itis worth mentioning that, whereas this condition simplifies the expres- 
sions obtained for the price, it is not crucial, in the sense that the computations still 
can be carried on. We refer to [13] for a further discussion on this topic. 


Once these features are settled, the CoCo's final payoff, is given by 


K fe rgd 
, hy au 
Klin g Srt) + S ce 7" lgh (5) 
p Z 
j=l 


where (S;);>o0 and (7;);>0 stand for the share price and interest rate, respectively. 


2.1 A Model-Free Formula for the CoCo Price 


In Proposition 2, a model-free formula for the CoCo price is given. Let us first 
introduce some notation required for what follows. Underlying to our market, we 
shall consider a complete probability space (2, F, P), endowed with a filtration 
F :— (Fi)rejo,r] representing the trader’s information—this includes the information 
generated by all state variables (e.g., share price, interest rates, total assets value,...) 
and the default-free market. All filtrations considered are assumed to satisfy the usual 
conditions of P-completeness and right-continuity. We shall denote the evolution of 
the money in the bank account by (B;);e[0,7), ie., 


t 
B= exp | [s]. O<t<T. 
0 


Recall that two probability measures on (2, F), P4 and P2, are said to be equiv- 
alent if, for every A € F, Pı(A) = 0 if and only if P2 (A) = 0. We shall assume 
the existence of a risk-neutral probability measure P*, equivalent to the real-world 
probability measure P, such that the discounted value of self-financing portfolios, 
(V, E 5 te[0, T]. follows a P*-martingale. Hereafter the symbol tilde will be used to 
denote discounted prices. Now, in addition to IP*, we shall consider other two prob- 
ability measures (also equivalent to P) which will allow us to carry on some of the 
computations related with the CoCo arbitrage-free price. First, letting (B(t, Tj)):>0 
stand for the price of the default-free zero-coupon bond with maturity 7j, we define 
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the T;-forward measure P% through its Radon-Nikodym derivative with respect to 
P* as given by 

T: 
GPT ^ e- do ndu 


dP* BT) ` " 


We say that PË is given by taking the bond price (B(t, Tj))1>0 as numéraire. 
Similarly, but now taking the issuer's share price (S;);-o—without dividends—as 
numéraire, we obtain the share measure po. its Radon-Nikodym derivative with 
respect to P* is given by 


dp)  e-Jolu-kG0Mus.. 


(7) 


dP* So 


In what follows, expectation with respect to P*, P4 and P®) will be denoted by E*, 
zT and E®), respectively. 


Proposition 2 The discounted CoCo arbitrage-free price, on the set {t < v), equals 


m 


Tı = E* [ K1c>7)| F] + C,E* [$r14 7) | F] + > E* [Slim | F] (8) 
j: Tj>t 
i T z C$. as) m = T; 5 
= KB(r, TIF (v > T| F,) + — POC <T|A)+ >. GBC, TP (Gy > Ti F). (9) 
el K(u)du 


Jj: Tj>t 


Proof Due to the Condition (F), to receive Es at time t is equivalent to receive 
p 


&Ssr at time T. Therefore the payoff in (5) is equivalent to 
p 


K 


K1 
{t>T} =P C, 


m T 
- ud 
Srltr<r) + S ceh É "lyst; (10) 
j=! 


and thus expression (8) for the price follows by preconditioning, taking into account 
that C, = on As for (9), it suffices to notice that, in light of the abstract Bayes rule, 


for every X e Y! (Zr, PŪ) we have 


pT; T; 
[Bla] «perl 
DIX] F] = = ; (11) 


" | dp | e^ Jo rudu Bf, T) 
dP* 


t 
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and thus 


T; 
a [xe Í J rudu 


z = B(t, TJET [X] FI. (12) 


Similarly, for every X € Z! (Fr, P®) we have 


Ona | Xsre" f rudu 


2 — e- JE edu ZOX] F. (13) 


Combining (8) with these identities we get the expression (9). 


With Proposition 2 at hand, the subsequent difference between models relies on 
how conversion and coupon cancellation is defined, and how the corresponding prices 
are evaluated. In this paper we follow a structural approach to price CoCos. That is 
to say, given a model for the share price (S;);+0, a series of critical time-varying 
barriers £ and £/ are set in such a way that £! >... > L” > £ and the credit events 
are given by 


t := inf(t > 0: S; < £(1)), and — rz:—inf(t > 0:S, < do}, jJ=l,..,m, (04) 


with the standard convention inf Ø := oo. Then our main concern is to derive analyt- 
ically tractable formulas for (8)-(9), either in the form of closed formulas or efficient 
simulation methods. Later on in Sects. 5 and 6 we shall incorporate short-term uncer- 
tainty and extension risk to the pricing problem. 


2.2 Pricing CoCos with Write-Down 


In the case of CoCos with write-down, upon the appearance of the trigger event the 
investor does not receive a certain amount of shares. Instead, she receives only a 
fraction R € (0, 1) of the original face value K, provided that the issuer has not 
defaulted. Let ó denote the random time at which the issuer may default. Then the 
payoff of this CoCo contract with write-down equals 


m T 
. rudu 
Kl. RKlcnlo.rn-c Sce" lij-rj- 
j=l 


Similarly to Proposition 2, we can give now a model-free price formula for the 
CoCo with write-down. For this matter, we make no further assumption beyond 
model consistency in the sense that ô and t are modelled in such a way that ô > c 
so that default may only occur after conversion. 
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Proposition 3 The discounted arbitrage-free price of the CoCo with write-down, 
on the set {t < v), equals 


TI .— (1 — R)KB(t, TP! (x > T| F) + RKB(t, TP! (8 > T| F) 
m 
ON. Bt, TP" (s; > T] Fo). (15) 


j: Tj>t 
Proof It suffices to notice that the payoff can be rewritten as 


R1 sss 1tr<s} =P lis = Rlis>s}lir<s} + Ls s s (1 = R)1fr>5}) 
= Rlis>s}lir<s} = R1 rss 1psss} F ad m RJ 
= Rlis>s} (1525 F lir>s)) F el = R)lir>s}, 


where for the second equivalence we have used the identity lir>s}lis>s} = Utrss}, 
which holds due to the consitency assumption t < ô. 


By comparing (9) with (15) we can see that the techniques used to price the CoCo 
with conversion can be readily applied to CoCo with write-down. Across this work 
we focus on the former contract. 


3 A Model with Stochastic Interest Rates 


In this section we assume that the price of default-free zero-coupon bonds are sto- 
chastic. More specifically, for j € (1, ..., m}, the default-free zero-coupon bond price 
(B(t, Tj) )te[0,T;] is assumed to have the following P*-dynamics 


d 
dB(t, T;) k 
— ——— = ndt + by (t, T))dW;, (16) 
BT) C 2. TUM 
where each Py is a positive deterministic càdlàg function, and (wi ous we )re[o,7] is a 


d-dimensional Brownian motion with respect to the risk-neutral probability measure 
IP* and the trader's filtration IF. We shall assume as well that the share price (S;);<[0,7} 
obeys 


d 
ds ; 
— odd M ox (taf, (17) 
S; 
k=1 
where o := (ox)4 is a positive deterministic cadlag function such that, for all t € 


[0, 7;], the inequality ||o(t) — b(t, T;)|| > 0 is satisfied. 
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The conversion and coupon cancellation events in our model are linked to the 


asset dividends and the evolution of bond prices, in such a way that conversion is 
triggered as soon as (S;);>9 crosses 


Tm 
Lr := LmB(t, Tm) exp n Kanan] > O<t< Tn; 
t 


and similarly, for j € (1, ..., m}, the j-th coupon cancellation is triggered as soon as 
(St)r>0 crosses 


L;Bt, Tj) exp uh «(udu Nc T 
Mj, t= Tj. 


g:— (18) 


The parameters M; and L; are assumed to be given non-negative constants satisfying 
Mj = Lj, with Lm < Cp and 


Li Tu 
—— € exp4— k(u)dur, j=1,...,.m—1, (19) 
Lj T; 
so that the required ordering 4! > & > ... > (" is fulfilled—thus ensuring that 


0 x 7 < Tj implies tj < 141, forj = 1, ..., m — 1. Clearly the Mertonian condition 
(.e., Sr, must be bigger than Mj) can be removed by taking L; = Mj. See Fig. 1 for 
an illustration of the barrier's shape and parameters. 


To T T, T; ene Tn 


Fig. 1 The graph illustrates the share price (5;);-9, along with the barriers @ and its parameters 
Lj and Mj. The first barrier is hit at t = Tı, whereas the third one is hit at some 7? < t < T3. On 
the other hand, the second barrier is not hit since the share price stays above £? on the whole period 
[To. T2], and the Mertonian condition is satisfied, i.e., Sr, > M». Conversion is not triggered either 
since the barrier £ is never hit by (S;);>0 
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In the current setting, the process (ul := log d) 10 plays a fundamental role. 
Indeed, from the definition of the random times r and r; (see (14)) and the barriers 
£ and # it follows that 


O<t<Tm 


{t = Inm = | inf U" > o and (rj > Tj) 
=| int, mt, v >0, vi, > toe AI J= l;o t: 
O<t< 


From this observation we have that, in order to price the CoCo contract, we need to 
be able to compute the conditional joint distribution of (Ur, j is Ur ), where we have 


defined ui T, = info<r<7; Ul. To this matter, notice that an application of the Itó 
Em <t< 
formula tells us that 


; 1 f 
dU! = "S lo) — b(t, Tj) |’ dt + |o t) — bt, T) | dWP. 


where (wP) t>0 is the P4i-Brownian motion given by the Girsanov theorem, corre- 
sponding to the probability change (6). In fact we can see that under IP)? we have 
similar dynamics 


"E 
du! = 5 Jo) — bt, 7) |^ dt + |o «0 — bc, T) | awe, 


where cw) 1-0 is the P)-Brownian motion corresponding now to the probability 
change (7). Consequently, a time change given by 


t 
aj(t) =f lo(s) -= bís, T° ds, O<t <7, (20) 


renders the fundamental process (U7 :— log Jp) a drifted Brownian motion. 
Then we can apply a known result (see [34]) on joint distribution of the Brownian 
motion and its running infimum in order to obtain the following closed-formula for 
the CoCo price. See details in [13]. 


Proposition 4 /n the current setting, the CoCo arbitrage-free price, on the set {t < 
Tm}, is given by 


ñ ; Sie à k (u)du 
Ti = b3 1;-2)cj | BE, T) (Cd — Dj) L; 
j: Tj>t / 


d —D)| QD 


K m 
+ KB(t, Tm) + (c. = 5) (La. Tn) ® (dP) + Se JP «dug (qm) , 


qm 
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Tj 
LB, Tel KO 
og J————— —— 


Jal s tah ow -bu Tpha 
+= , an 
V [P |o@ — blu, TP du 
1 Mj 
D; = log (22) 


VIP jouw -bu T [du 4 


and ® denotes the standard Gaussian cumulative distribution function. 


3.1 The Black-Scholes Model and the Greeks 


The Black-Scholes model is obtained as a particular case of the model above, by 
taking default-free bond prices with null volatilities—i.e., by taking the b(-, Tj) in 
(16) to be zero. Consequently, the closed-form price formula given by 
Proposition 4 can be used in order to derive the Greeks, Delta A and Vega v, which 
respectively describe the CoCo’s price sensitivity to share price and volatility. We 
have the following. 


Proposition 5 Let © and ¢ denote, respectively, the standard Gaussian cumula- 


tive distribution and density function. In the Black-Scholes model, the CoCo’s price 


sensitivity to share price A := = 5 and to volatility v :— = ot are respectively given by 


2 : 
A= Wren 7 pb) eu.) 
PE i AE VTj-t 


K 2 m m 
«(z c) (= own). 


and 
c; log g $ -r(Tj-1) : . : 
Tou" E m (Le + sed!) 
j: Tj>t L Tj B ( 
log =" =r(T — t) 
G (P(N) Le + s e c) , 
z) o Tj mn t 
where 


E 
jy E 50°)(Tj —t) 


+ o/Tj—t , J D 


,m. (23) 
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Some remarks are in order. On the one hand, it is has been documented that by 
actively hedging the equity risk, investors can unintentionally force the conversion 
by making the share price deteriorate and eventually trigger the conversion. This 
situation is referred to as the death-spiral effect. Now, from the above expression for 
the Delta A it can be checked that it is strictly positive, and in fact one can observe 
that the Delta A increases sharply when the time to maturity T decreases. Thus one 
of the conclusions in [13] is that the coupon cancellation feature leads to a flatter 
behaviour of the Delta A, hence reducing the death-spiral risk. On the other hand, it 
can also be checked that the Vega v is strictly negative, this tell us that an increase in 
the volatility translates into a decrease in the prices. Such observation is clearly in 
line with the intuition that a higher volatility will increase the probability of crossing 
the barriers £ and ¢/ defining the conversion and coupon cancellation events. 


4 Advanced Models 


4.1 Incorporating the Heston Stochastic Volatility Model 


Let us start this section by remarking the fact that the arguments preceding the obtain- 
ment of Proposition 4 hold even if the share and default-free bond price volatilities 
(i.e., o and b(-, Tj) in (17) and (16)) are no longer deterministic. However, the time 
change aj in (20) would be now stochastic and, consequently, the time-changed 
fundamental process dynamics 


aUi = —À ES dW/i, Q4) 
would no longer match those of a drifted Brownian motion. Thus one anticipates that 
the pricing problem, in the setting of stochastic volatility, will lead to closed-form 
formulas only in few cases, and require more advanced numerical tools otherwise. 

As a particular model, we shall assume that the volatilities are stochastic according 
to the work of [26]: we consider a new stochastic factor (V;);>0 acting on both o and 
b(-, T;), in such a way that the dynamics in (17) and (16) are now replaced by 


d 
dS 
A. - [r, — «(dt + V/V, > ok (AWE, (25) 
f k=1 
and 
dB(t, T;) 4 
3]. k 
— dps] > .(£, Tod 2 
Bü. Tj) redt + y Vi by(t, Tp)dW;, (26) 


k=1 
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respectively. The factor (V;);>+0 is given as the solution to the following SDE 


dV, = [æ — BV;ldt + y / V;dZ;, 


where a, P and y are constants, with 2o > y? to ensure the positivity of the solution, 
and (Z;)+>0 is a one-dimensional P4i-Brownian motion. Similar dynamics under the 
share measure IP?) are assumed to be satisfied by (V)r>0. 

If we further assume the independence between the noises driving the prices and 
their volatilities, then we see that the P/-Brownian motion in (24) is independent of 
the (now stochastic) time change 


t 
aj(t) =f V; |o (s) — b6s, T)) | ds, O<t<Tj. (27) 
0 
Thus, by a preconditioning argument, we obtain the following extension of Proposi- 


tion 4. 


Proposition 6 In the current setting, the CoCo arbitrage-free price, on the set {t < 
Tm}, is given by 


m 
m= M deg | Be TEV (d — DIA 
j Tj>t 


SETI (d! — DIF: 
Del k(u)du 


KS. 
— Emio (dh F] 


+ KB(t, Tm)E™ [6 (-d")| Fr] — 
Led 7 «(u)du 


Tm 
+ CS1 fi" COURO Lp (qr)! F + CrLmBlt, TEPID (47)] Fil, 


where 
LjB(t teli Kos ap 
eg ae IO a (28) 
+ JE Vs lo (s) — b(s, T;) E ds 
and 
1 M; 
Dj = " " 


VI v. o) = bis, To [as 


From the pricing formula above we can see that the CoCo price is related to the 
price of binary options. Indeed, for instance, for a binary option with maturity 7; and 
strike Mj we have 
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F] 


T; 
p* 7 rid 
|: fer t "ls; <M) 


5| = B(t, T)E” | isy <m) 


= B(t, TP” (vi slez 
J 


Moreover, according to our previous discussion, conditioned to FY :=0 (V, 0< 
J 


s < Tj) the random variable U}, is normally distributed, so that we also have 
J 


T; j j 
P (vi < log Ln 


2 = EU [6 (d + DIF]. 
J 


Let us now give an explicit computation of the probability above; this illustrates how 
the CoCo price can be computed. For this matter, we use the relationship between 
the characteristic and distribution functions (see for instance [38]) which allows us 
to write 


p^ (v, < log — 
J Lj 


1 1 
Fr = -+ >— 


M; i Ts . Mi aig J . 
n (4) g(t, Ur Vii -8) - (Z) Q (t, Ur, Vi; E) 
0 ig 


dé, 


where 


9 (t, Un Vj; £) := EP [exptiEU))].,] = E” [exp{iE(U})} Ur, V,, — G0) 


and the last equation holds by Markovianity. Hence the problem of computing 
the probability above translates into the problem of finding an expression for 
g7 (t, u, v; £). It follows from the Itó formula that 


ag  18?gh , 18g ,  18gh , agli 
vor, + y^v vor. 
ot 2 du? j 209v 2 ðu J Qv 


(a — Bv) 20, Gl) 


with the boundary condition g” (Tj, u, v; E) = e$“ and where o2 = ||o —b(., T). 


T; 
For an affine solution like 
p(t, u, v; E) = ePi] BT vitu, (32) 
the PDE in (31) is reduced to the Riccati equation 
8B; | 2p? + BB; = — (18? — i&)o2 
g 2Y 5j == 25) OT, 33 
3A; (33) 
a = Bj, 
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with A; and B; vanishing at t = Tj. As shown in [13], for a constant function oj, R 
such equation is explicitly solved by 


AG) B  exp[-40;-0]- 1 


BT) — t) = 
iT; — 8) y? exp { MT; t)} + nets 
and 
Aj( = ed eie [ Flos (AE) = exo [AET - 0) X955) 


HE, - p]. 


where A(£) :— "S + yor, (€2 — i£), and ,/- denote the analytic extension of the 


real square root to C V R_. 


4.2 An Exponential Lévy Model 


In this section we shall consider an exponential Lévy model for the share price. As 
opposed to the previous sections, we shall now consider a numerical approach to 
pricing, based on exploiting the so-called Wiener-Hopf factorization of the driving 
Lévy process (X;);+09. This approach has been recently applied in order to price 
contracts with path-dependent payoffs as in [12, 31]; see more details below. 


4.2.1 First-Passage Times and Wiener-Hopf Factorization 

Let (X;);>0 be a Lévy process with characteristic triplet (u, o, v), and denote its 
characteristic exponent by wy. For details and proofs of the following arguments we 
refer to [1]. 


Recall that if e(4) is an exponential random variable with parameter A, independent 
of (X;)+>0, then we have the following equality in distribution 


Xea) — 1-4 S, 
where J and S are independent random variables, distributed as 


X = inf X, and X sup X, 
Ael) O<uce(d) u e(A) Ram u 


respectively. Moreover, 


: [exp (2X«05)] = E [exp {2Xoay}] E [exp X205] - (34) 
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We shall refer to (34) as the Wiener-Hopf factorization. In fact, in general it holds 
that 


Xr 
2 [exp [2609] =~: 


Consequently, the knowledge of one of the factors in (34) allows us to establish the 
other one. A particular case of interest arises when (X;);>0 is a spectrally negative 
process since, in this case, it is known that the right factor in (34) is given by 


; . v By 
vy (2) := E [exp {zX-ay}] = rm (35) 
where f; is a constant, depending on A, defined as the solution to 
Vx(p) =A. (36) 


Therefore, once we have computed $, explicitly, we obtain the following expres- 
sion for the left factor in (34): 


A B.-—z 
A—wx() B 


V; © = E [exp (25,5))] = (37) 


This expression can be linked to the distribution function of X, by partial integration. 
Indeed we have 


oo 0 
yo f Xe tdt J ze P(X, > £)dé 
0 —oo 


=z n B f D e ^5 F(t, E)dtdé (38) 
0 0 


= AzF(A, z) 


where we have defined F(t, £) :— P(X, > —&), and denoted its Laplace transform 
by F. As argued by [33], by combining (37) and (38) we can recover F by the standard 
Fourier transform inversion. Further, the result can be numerically computed in an 
efficient way, provided the condition 


M aP 
lim - [e* -1l- zxlgx<1) | v(dx) = 0, (39) 


200 Z J_oo 


holds true. This condition is imposed in order to facilitate the computation of B; (i.e., 
the solution of (35)) by means of suitable integration contour change. In fact, such a 
change allows to take 8; = 4/2. This result is summarized in the following lemma. 


196 J.M. Corcuera and A. Valdivia 


Lemma 7 For fixed t and &, and given the parameter set (A1, A», l1, l2, Ni, N2), 
define 


Ai Ex. uim. Aper eee 
üj]:— ——.y d5»i— SS i= —, =, x)= X 
lu lcm Sm cad E EI 
and, for every N € N, 
N N 
, hihs dg " 
sy (t, E) = ym > 2 ac Pa tinm) 
n=—N k=—N 


F(g(ai + inh), a2 + ikh;)e'£ ut tinh )+x(a2+ikhy) | 


If the condition (39) is satisfied, then the following approximation holds true 


No 


N. 
PX, -52» ( 2 Jost. £), 


n=0 
where the symbol = indicates an Euler summation. 


The double sum in the lemma is used as an initial approximation of F. Here the 
parameters (A, A», l1, l2) are positive real numbers chosen large enough in order to 
control the aliasing error. The final Euler summation is used in order to improve the 
accuracy of the raw approximation sy. It is suggested that choosing Ay = A» = 22, 
l = h =1,N, = 12 and M = 15 gives satisfactory results. For further details see 
[33] and references therein. 


4.2.2 The One-Sided CGMY Lévy Process 


Hereafter we shall focus on a particular spectrally negative Lévy process (X;);-0 
known as one-sided CGMY process, or simply CMY process. This process has no 
continuous part, and only one-sided jumps with its Lévy measure being given by 


v(dx) = C exp(—Mx)|x| !  120jdx, (40) 


where C, M > 0 and Y < | are constants. Further, its characteristic exponent can 
be obtained in closed-form as 


0 
Wx(z) = uz «f [e* — 1— zd qq] vx) 


—o0o0 


= uz + CI'(-Y)(M + z)! — M”), (41) 
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where I” stands for the Gamma function. Thus it is apparent that the condition (39) 
holds true. Before we proceed, let us mention that the CGMY processes are also 
referred to as Tempered stable processes. On the other hand, by setting the parameter 
Y — 0 (resp., Y — 1/2) the CMY process becomes a Gamma process (resp., Inverse 
Gaussian process). For details on CMY processes we refer to [4, 31] and [36, Section 
2.3.5]. 

In order to price CoCos we need to understand the behavior of (X;);>0 both under 
the risk-neutral measure P* and the share measure P“). The following result shows 
how the Lévy characteristics of (X;);>9 change under Esscher transforms. 


Lemma 8 For every real number 0, consider the probability measure P" , equivalent 
to P, given by 
dip* _ expíoXr] 


dP i [exp (oXr]] l 


Assume Ma := M — a > 0. Then the Lévy exponent of (X;)i»o under P® is given by 
0 
Vy (2) :— ZH «f [e* — 1 — zxdqg421] vo (d), 


—oo 


where 
Ua =U «f x(&* — 1)v(dx), and — vy (dx) := C exp{—Mgx}|x|7!~" 14 oy de. 
Ix[x1 


Proof The first part follows from (41) and [35, Theorems 33.1 and 33.2]. Now, using 
the expression in (40) we have 


vo (dx) = Cexp(- (M — a)x]|x| ! " 1jzojdx = C exp{—Mex} x7! 1p. code. 


The assumption M, > O assures that (C, My, Y) is a rightful parameter set for a 
CMY distribution. 


4.2.3 Application to CoCos 


We shall assume that, under P*, there is a pure-jump (C, M, Y)-Lévy process (X;);>0 
driving the share price (S;);>0 in such a way that 


exp {X;} 


.— a(r—k)t 
Ser” BF [exp X] 


= exp{ut + Xj), t> 0, 


where the interest rate r and the dividends « are assumed to be constants, and we 
define w := — log z*[e*! | and set H := r—Kk +w. Further, we shall assume that the 


198 J.M. Corcuera and A. Valdivia 


parameter M is bigger than 1. We remark here that, on the one hand, this technical 
assumption will allow us to accommodate the Variance Gamma (VG) process con- 
sidered in [30]. On the other hand, this assumption is consistent with the numerical 
experiments reported by [31]. 


Proposition 9 In the current setting, the price of a CoCo at time 0 < t < T can be 
numerically approximated in an efficient way by means of the expression 


m 
me > eU PLI Ke ts pO Tm) + oe On pL GT. 
ji Tj>t 
where 


No 
" ONO(N2Y S ; ; 
P «ms D 2). (1) -tlog i +o(T}-9), f= lem, a =0,1, 
n=0 
(42) 
with the symbol = indicating an Euler summation, and 
N N _1,Y-l 
? Ha + YCF'(-Y) (Ma + uz!) 
S5: >) > PA 
n=—N k=—N didici 


Fa (ga (a1 + inh), a2 + khpe anaha, 


with the parameters (ai, a2, hi, h2, ly, Do, Ni, N2) given as in Lemma 7, and 


Fy(a,2) :— c 
TUUS (= aa = CP (HY) (Ma + 2! = MD) 22 


—1 : Y 
£400 = x + CP (-Y) (Ma + n x) -Ml). a=0,1, 


where My and [Ly are defined in Lemma 6. 


Proof Taking into account the general expression for the CoCo price (c.f. Proposition 
2), computing zr; boils down to compute 


Pr, Tj) = P* (v; > T| F) = P(X, > | i 
j € T9 (aena Gum c E=log(S;/€i(t)) +o (T-t) 


forj = 1, ..., m, and 


Pl (t, Tm) = P (tn > Tm| Fz) 2 PO (x -— | ; 
nt m) ( m ml t) XT,- £) £—log(S, /£" (+0 (Tm—t) 

These computations can be carried out by means of Lemma 7. Indeed, under this 

exponential Lévy model, the share measure IP(9) (resp., risk-neutral measure P*) 
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coincides with an Esscher transform of parameter a = 1 (resp., a = 0). In light 
of Lemma 8, the driving noise (X;);>09 will remain a CMY process under PO but 
now having the shifted parameter set (C, Mj, Y) = (C, M — 1, Y). Moreover, this 
implies that the Lévy measure of (X;);+0 also satisfies the condition (39) under po. 
Thus, if we write P? = P* and IP! = IP), by the reasoning in Sect. 4.2.1 we see that 
the Laplace transform of Fy (t, E) := P” (X, > —&) is given by Fy as defined above. 
Moreover, the correspondent contour change is given by gy. 


Remark 10 [12] provides an alternative approach which exploits the Wiener-Hopf 
factorization in a different way: instead of computing first-time passage probabil- 
ities as done here, what is computed is the joint density of (X;, X,). As the noise 
driving share prices, the authors consider the so-called Beta-Variance Gamma (B- 
VG) process—also referred to as Lamperti-Stable process by [3]—which exhibits the 
same exponential decay as the Variance Gamma process, hence leading to a smile- 
conform model. For this B-VG process the distribution of the variables X4(;) and 
Xeo) can be specified, thus obtaining the Wiener-Hopf factors wy and y, . Taking 
(8) into account, combining the knowledge of the (X;, X,) density with a Monte- 
Carlo technique due to [29], the authors provide an efficient numerical pricing of 
CoCos. 


5 Triggering Conversion Under Short-Term Uncertainty 


Linking credit events to the movements of a fully observable (i.e., F-adapted) process 
(U;)1>0 is certainly one of the most appealing features of structural models. Indeed, 
this full observability assumption—hereafter referred to as (A1)— gives rise to clear 
and analytically tractable models as we have seen in the previous sections. When 
considering contingent capital contracts such as CoCos, however, the assumption 
(A1) seems arguable since in most cases regulatory capital depends on the balance 
sheets of the issuer, and those sheets are updated only at a series of predetermined 
dates (t;)ren. Thus we are interested in considering the following partial observability 
assumption. 


Assumption (Al'). The fundamental process (U;);-o is fully observable only at 
predetermined dates (1j);eN. 


On the other hand, when the process (U;);>0 is related to the share price, it is 
also commonly assumed that the correlation between the noises driving the share 
price and (U;);>0 is equal to p = 1 (or p = —1)—hereafter this assumption will be 
referred to as (A2). Nevertheless, it would be reasonable to consider the chance that a 
different (possibly time-dependent) correlation parameter p € [—1, 1] may provide 
a better fit. Consequently we are also interested in considering the correlation p as 
an additional rightful model parameter by taking the following alternative to (A2). 


Assumption (A2’). The correlation p between the noises driving the share price and 
(U;);-9 may vary. 
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In what follows we revisit the simple framework of Sect. 3.1 in order to illustrate 
these ideas and show how the pricing problem is modified under (A1’) and (A2’). For 


a full study of this short-term uncertainty model we refer to the forthcoming paper 
[15]. 


5.1 Pricing CoCos on a Black-Scholes Model Under 
Short-term Uncertainty 


As shown in Sect. 3.1, in the Black-Scholes model, 
dS, = S,([r — «]dt + odW;», 


the cancellation of the j-th coupon is triggered as soon as the process 


dites dog ety 
t= 87 =I dt -- cdW;, 
t 


A Mi. , . 
crosses the critical value log Io j = 1,...,m, whereas for conversion zero is the 
J 


critical level. In this setting Assumption (A2) is translated as the correlation structure 
between the noise driving (S;);>09 and that of the new process 


1 1 
dU;(p) := -59 dt +odW? := 59 dt ^ cd(oW? + v1- p?Z), (43) 


where p is the given correlation parameter, and (Z;);>0 is a second Brownian motion, 
independent of (W7),- 9. Thus, instead of the process (U;);>0 above, we shall now 
consider the parametric family (U;(p));-o whose driving noise (Ww? )i»o is also a 
Brownian motion but correlated to (W;*);>0, in such a way that dw? dW; = pdt. 
Further, the time at which the j-th coupon may be cancelled is given by 


Mj 
T(p) :— inf 4f z 0 : Up) x log Ll 
j 


As for Assumption (A1’), notice that the full information flow corresponds to 
G, = 0o (W*, Zs, 0< s < t) = FV v FZ,  t-0, 


whereas, setting |t] := min(tj € {0, t1, t2,...} : tj € t « tj+1}, the information 
available to the modeller is now given by 


Fy = FN veZ0sssip-F" vs 120. 
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Since Y 2 .Z,, and the equality holds only at the predetermined dates (tj, j € N}, 
we can think of (Z;);>0 as an extra source of noise which clears out at update times 
(t;)jen- The fact that the extra noise is cleared out at (rj);cew motivates the notion of 
short-term uncertainty, and it has two important implications. On the one hand, our 
model differs from other partial or incomplete information models like [9, 10, 21], or 
[25] since the information structure is different. On the other hand, as opposed to other 
structural models, the short-term uncertainty considered here prevents the conversion 
time z; (p) from being a stopping time with respect to the reference filtration IF, which 
is generated by the relevant state variables and the risk-free market. Hence, one can 
investigate conditions under which rj(o) admits an intensity, as done by [9, 21], or 
[27]. 


5.2 Coupon Cancellation Probabilities Under Short-Term 
Uncertainty 


Let us show how the coupon cancellation probabilities are modified under the 
assumptions (A1’) and (A2’). We begin by defining two auxiliary processes 


S, £ 
Gi oV1-pZ Zu) ad &:cplg.--plg;—, 120. 
Lt] Lt] 


These processes have an important role in the computations within our short-term 
uncertainty model since they appear implicitly in (U;(p));>0 according to the fac- 
torization 


U;(p) = (Uis (p) + &) T (44) 


It is apparent that the term between parentheses belongs to F, = FW v F f p On 
the other hand, 7; is independent of Fry, and it is normally distributed with zero mean 


and variance 
v(t) :2 (= e^ — Lpo?. 


We note here that the variance v? (f) represents a key quantity within this frame- 
work. Indeed, on the one hand, it actually encodes the two new features of our 
model: the factor 1 — p? measures how close (S;);+o9 and (U;);«0 are to being com- 
pletely correlated; whereas the factor t — |t] measures the elapsed time from the 
last information update. On the other hand, as the following result suggests, coupon 
cancellation probabilities and other analytical formulas obtained within our model 
depend explicitly on v(t). 
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Proposition 11 For every x € R define the random time t (p) := inf (s > 0 : Us 
(o) x x}. Then, for every 0 < t < T, the following equation holds true on (x > |t]} 


? cr —(U p Sit lt} 
p* Tx(p) > TIF) = T [0] D_+ ) e (Ui (o0)—3) ( 
( x( ) | 1) | ( — JT $46 


[9 3] 


where the expectations above are restricted to the values 


Ox - Ung) = & € 50*(T — 1) 


+ aT (45) 


Moreover, 


p* |? (-»- E E EJ = f Ó (-»- + =) $ (z)dz (46) 
ONT —t R owT —t 


and 


. bt mo zw(t) — v? (t) 
nete (p. - 2 —]] =e! [ o (n. - 29-99) cias 
| y oT —t R * oT —t $0 
(47) 
where @ and ¢ stand, respectively, for the standard Gaussian cumulative distribution 
and density functions. 


Proof Since under 4, the computation is known, for every t = t; we have, on 
(o) > t}, 


P*(tx(p) > TIF) = P*(tx(p) > TI) 


1.2 . 
o (= + Ulo) -FsaQq e 2) e (0079 p 


o/T-t 
x — Uy (9) + 30° — t) 
o/T-t 


x —U,(p) + 1o?(T — t) 
Tr : 


Define 


di (t) = 
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For t ¢ (tj, j € N}, by conditioning we get, on {t,(o) > Lt]}, 


P*(n(p) > TIF) = E' [P'((9) > TIG| Z] =E*[ e (-4^ €) 2] 


= [e (U: (p) p (a.))| 2] 


In these terms, for |t] < t, the first summand above satisfies 


"— zy cue NS SS 
2 [6 d- M) F] = [e( pu) 


where the right-hand side of the expectation above is restricted to the value of D_ 
given in (45); in fact the equation above reduces to (46) since £; ~ N(O, v? (t)) for 
every fixed t > 0. Similarly for the second summand, it follows from (44) that 


HEE = x - = a ot 
E* E (U;(9)—3) gy (a1)| 2] = euo) [e üg (». = ) 
oV/T -t 


= e (Uu (are ) E* [ewe (>. = ĉr ) 2 
Sili oT —t 


In order to obtain (47), let us consider the change of measure given by 


Tm 


1 T 

ux = exp E = vo = exp n c V1 = p^ a )dZ, 
] fT 

pi [e V1 — pua oras 


aja 


In virtue of the Girsanov theorem, the process 
S 
Z= -f oyl — Pliu gadu, s> 0, 
0 


follows a IP -Brownian motion. Thus 


Tet nce lata 

E o (D+ - ——; Ey |® (D+ - a= 

ala fo oA — ppp az, + vq) 
u oȚyT —t 


fo 2i x4?(n ev[-z5] 
R o4T —t]  J2xv?() 


vz + v(t) 
= &{ Dy — —————— d 
n ( a eT Jo z; 


p 
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where the first equivalence follows from the abstract Bayes’ rule, and for the last 
equivalence we have simply used the standard change of variables z = OE 

Jeanblanc and Valchev [27] study the role of information on defaultable-bond 
prices within a Black-Scholes setting, with constant parameters and flat default bar- 
rier. In some sense, the short-term uncertainty model considered here can be seen 
as a bivariate extension of [27]. Despite this analogy, immediate differences arise. 
For instance, the survival probabilities obtained by [27] are constant between obser- 
vation dates, i.e., within each interval [tj, 441). Whereas in Proposition 11 these 
probabilities vary in continuous time. This difference relies on the fact that, event 
though within each interval [rj tj+1) our knowledge of the short-term noise 7; is 
constant, we still fully observe the evolution of (S;);>0 and all the other F-adapted 
state variables. 

In order to conclude this section, let us recall that in light of our discussion 
in Sect.3.1, CoCo prices can be obtained in our current setting once we compute 
expressions of the form 


P*(g(p)- Tj, Sr, LZ), and PO (n(o)- Tm, Sr, > Ll F1). 
It is worth noticing that the Z,-conditional joint distribution of (rj(o), ST) = 
(inf s<T; Us(p), ST,) cannot be computed directly from Proposition 11 since the 
entries of this vector are driven by two different (though correlated) Brownian 
motions. An additional complication comes from the fact that the current infor- 


mation about one of them might be incomplete. The aforementioned distribution, 
and full details on the model, can be found in [15]. 


6 Extension Risk 


According to the new regulatory Basel III framework, CoCos can be categorised as 
either belonging to the Additional Tier 1 or Tier 2 capital category. In order to belong 
to the former class, a CoCo is supposed to have the coupon cancellation feature and, 
further, no fixed maturity is to be imposed to the contract. Instead, the issuer is entitled 
to redeem the CoCo at any of the prespecified call times (T;, i € N}. Moreover, as 
opposed to the common practice on callable contracts before the 2008 financial crisis, 
the definition of this contract does not contain any incentive (e.g., a coupon step-up) 
for the issuer to redeem at the first call date. Investing in such a contract has the 
inherent risk of a financial loss due to the lengthening of the (investor's) expected 
maturity duration which ultimately postpones the payment of the face value K. This 
risk is referred to as extension risk. Two recent papers [14, 18] have addressed the 
problem of pricing CoCos belonging to the Additional Tier 1 capital category. As an 
illustration, let us revisit the Black-Scholes model in Sect. 3.1. 

In order to emphasize the correspondence with call times, we add now an extra 
index i € N to the coupon structure, in such a way that a coupon cj; will be paid 
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at T; provided t; > Tj. It will be assumed that for every i € N the ordering 
Tj-1 < Ti, ... € Tim :— T; holds, where we set To := 0. For the sake of clarity, let 
us remark that in the current setting, the barriers in (18) and their parameters become 


a :— FM O<t<Ty 

M, ij t= Tj. 

From the issuer's point of view, the question of whether to postpone or not the 

face value K payment depends on which alternative is cheaper. Hence, similarly to 

the situation of Bermuda options (see for instance [37]), the discounted price of a 
CoCo belonging to Additional Tier 1 capital category equals 


" inf E* Es Fr, |, t=T, e (Tj, €N} (a) 
f: em (48) 
2* [zm | Fr] » tE (Th, Tho); (b) 


where %, stands for the set of stopping times taking values in {7;, i > n} and 


LTj—0 m K 
Ze = >» 2 Ole jor» Mj) + Ke "915.0 + ct Suse. 
i=n+1 j=1 P 


It is important to remark that even though the general optimal stopping theory allows 
us to characterize the solution to the optimization problem in (48a), this is not enough 
to tackle whole pricing problem. Indeed, the solution to (48a) must be obtained in 
a relatively explicit way in order to be able to give a reasonable expression for the 
price in-between call dates (48b). Here we shall address the finite horizon case, i.e., 
the case where there are only finitely many call dates (71, ..., Ty} and Ty < oo. 

In this case, for every fixed T,, the solution to the optimization problem in (7.6.1) 


gessi V goose Ee Pe VV NAL EET ss 


given by 


k m 
= m B K ~ " 
x = 2 = Ke "T1 sr) + c* Selist + > > Cijlizj» Tj, 5r; - Mj 
P. i=n+1 j=1 


xo z 
POJN w k=N (49) 
k 7 : va n 

min [ZO E [Y |. 25 ]]. &2 - 1... 
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As it turns out, the lower Snell envelope qm ke(n,...,N) can be obtained in a rather 
explicit form. Indeed, for the first iteration of (49), if t > T, then what we have is 
the raw expression 


Fy? = min [332 E [XP Fra} 
K ius: m 
= min {ker lir>Ty-1} + Go Sellest- F 5 S aede Srj>Mij}> 


i=n+1 j=1 


7 K m 
er TNA rs] + ct Suesni T > S čil {tj>Tij . S15 2 Mij) 
P i=n+1 j=1 


Fry- J} 
= m 


G Er KS fv<Ty_i1} + » DAM ey>1).57, >My) (50) 


i=n+1 j=1 


Tdg.r min peo ; 


m 
T; 
B*| Ke 2 NI {t>Ty} ptt ce KS {t<Ty} p+ Do tnl [xwj 7 TNj. Sry; >Mnj} Jw |] 
p j=l 

K N-1 m 
eee KTS 1 GA 1 inl Ke T- zx 
= C e T [cxTya) + Cij [rj Tij. Sr; 2 Mij) + Lies7y_,} min e >UTy_1 [> 

p i=n+1 j=l 


where as in the previous section, 77,_, denotes the price of CoCo here with maturity 
Ty and coupon structure (cj, Tj, Tj). '..,. Due to the share price Markovianity, the 
price 77,_, can be seen as function of Sty_,; we denote this function simply by 
Try (x). Now, as discussed in Sect.3.1, the CoCo has a positive Delta, thus the 
function Z7, , (x) is increasing and we can find a value Sy , such that 


S5 p: inf E -0:5y0)2 gets. 


Hence we obtain 


— m 


voa (€ ekt 
Yy-1 = G S (r<Ty-1} + > 9X Uu» Tij Sr; 2 Mi) 
i=n+1 j=1 
—rTy_1 ES 
+ Ke Uesty_1,Sry_,=9%_4} + Try Ur>Ty_1,Sty_, ESO 


N-l m 


K 
= —rTy-1 KT 
= Ke lir>Ty- 1-STy—1 ZSN-1} pt © e Fa" (t<Ty_1} + > > Cil [rj Tij Sr; 2 Mij) (51) 
i=n+1 j=1 


K EE 
* rTN KT 
+E [ke NI {t>TN,STy_1 «SN 1l pE Cy e Sr lsr, «Sy p TN-1<1<Ty} 


m 

n g. 
» CNj Yir; Tj Sry » Myj.Sry. i a |a]. 
j=l 


So far we can see that basically all indicator functions appearing originally in (50), 
has been augmented in (51) by an additional condition on Sr, , (i.e, Sty_, < Sy 1 
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Oor$py, , = Sy_ 1). In [14] it has been shown that the computation of the Snell envelop 


Recor 


it can be seen that the payoff of a CoCo having the callability feature can be written 
in terms of 


where [t] is the element of (Ti, ..., Ty} such that [t] — 1 < t < frt], and the 
additional variables ST, ..., S , are defined by analogous reasoning to that behind 
the obtainment of Sy ,. Here Sj :— oo and Sy := 0 are set by convention. 


Proposition 12 Jf the CoCo with extension risk is active and conversion has not 
occurred, then its discounted arbitrage-free price is given by 


Hh? = = Tup” (si > Tij, ST) < SQ. ope < Sioa, ST; > Mij, |n) 
Lj: Tjt 
+ > KP (t > Ti Sry < S$ o ST < Sio Sr; > St| F) (52) 
i: Tj>t 


N 
K rn 
+> a URDU (t < Ti, Snc cep AP eS anms). 
i=l 


Proof With the explicit description of the payoff corresponding to the CoCo with 
extension risk, the result is obtained as in Proposition 2, here taking into account the 
following identity 


Ir < Ty, Sy, < SQ; es SprJ-1 < Sta] 


= Wt x Ti Sn < $5, S4 < S14 Sr, = ST]. 


In view of this proposition, the obtainment of a closed-form formula for the price 
CoCo with extension risk requires the knowledge of the conditional distribution 
of (T, $1, S7,,..., $75) for i = 1,..., N. In the Black-Scholes model, this can be 
achieved by means of the following general lemma obtained in [14]. 


Lemma 13 Let (B;);>0 be a Brownian motion with drift u and volatility o, and 
denote by x its first-passage time to level zero. Then, for arbitrary instants T} < 

- < T, and arbitrary non-negative constants a4, ..., ay, on (x > t} the following 
equation holds true 
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P(t = Tn, Br, < 4... Br,_, < In—-1, Br, > an| Fi) 
=P (—a < Br, < dj, —~an—1 < Bp, < an-1, Br, > an| 4) 


—2uo-!B n 5 n 
—e ^M? PiP (—a; < Br, «aj... —an-1 < Br, < ag 4, Br, < —an| Fr), 


where Br, = Br, — 2u(Tj — t), j = 1,...,n, and (F;1)1>0 stands for the natural 
filtration generated by (B;);>0.- 
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Quantification of Model Risk in Quadratic 
Hedging in Finance 


Catherine Daveloose, Asma Khedher and Michéle Vanmaele 


Abstract In this paper the effect of the choice of the model on partial hedging in 
incomplete markets in finance is estimated. In fact we compare the quadratic hedging 
strategies in a martingale setting for a claim when two models for the underlying 
stock price are considered. The first model is a geometric Lévy process in which 
the small jumps might have infinite activity. The second model is a geometric Lévy 
process where the small jumps are replaced by a Brownian motion which is appropri- 
ately scaled. The hedging strategies are related to solutions of backward stochastic 
differential equations with jumps which are driven by a Brownian motion and a 
Poisson random measure. We use this relation to prove that the strategies are robust 
towards the choice of the model for the market prices and to estimate the model risk. 


Keywords Lévy models - Quadratic hedging * Model risk - Robustness : BSDEJs 
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1 Introduction 


When jumps are present in the stock price model, the market is in general incomplete 
and there is no self-financing hedging strategy which allows to attain the contingent 
claim at maturity. In other words, one cannot eliminate the risk completely. However 
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it is possible to find ‘partial’ hedging strategies which minimise some risk. One way 
to determine these ‘partial’ hedging strategies is to introduce a subjective criterion 
according to which strategies are optimised. 

In the present paper, we consider two types of quadratic hedging strategies. The 
first, called risk-minimising (RM) strategy, is replicating the option's payoff, but it is 
not self-financing (see, e.g., [19]). In such strategies, the hedging is considered under 
a risk-neutral measure or equivalent martingale measure. The aim is to minimise 
the risk process, which is induced by the fact that the strategy is not self-financing, 
under this measure. In the second approach, called mean-variance hedging (MVH), 
the strategy is self-financing and the quadratic hedging error at maturity is minimised 
in mean square sense (see, e.g., [19]). Again a risk-neutral setting is assumed. 

The aim in this paper is to investigate whether these quadratic hedging strategies 
(RM and MVH) in incomplete markets are robust to the variation of the model. Thus 
we consider two geometric Lévy processes to model the asset price dynamics. The 
first model (5;);epo,7] is driven by a Lévy process in which the small jumps might 
have infinite activity. The second model (S/);<[o0,7] is driven by a Lévy process in 
which we replace the jumps with absolute size smaller than £ > 0 by an appropriately 
scaled Brownian motion. The latter model (57 );ejo,| converges to the first one in 
an L?-sense when € goes to 0. The aim is to study whether similar convergence 
properties hold for the corresponding quadratic hedging strategies. 

Geometric Lévy processes describe well realistic asset price dynamics and are 
well established in the literature (see e.g., [5]). Moreover, the idea of shifting from a 
model with small jumps to another where these variations are represented by some 
appropriately scaled continuous component goes back to [2]. This idea is interesting 
from a simulation point of view. Indeed, the process (57 )rejo, r] contains a compound 
Poisson process and a scaled Brownian motion which are both easy to simulate. 
Whereas it is not easy to simulate the infinite activity of the small jumps in the 
process (S;)+e[0,7] (see [5] for more about simulation of Lévy processes). 

The interest of this paper is the model risk. In other words, from a modelling 
point of view, we may think of two financial agents who want to price and hedge an 
option. One is considering (S+)rejo,r] as a model for the price process and the other 
is considering (57);cjo, rj. Thus the first agent chooses to consider infinitely small 
variations in a discontinuous way, i.e. in the form of infinitely small jumps of an 
infinite activity Lévy process. The second agent observes the small variations in a 
continuous way, i.e. coming from a Brownian motion. Hence the difference between 
both market models determines a type of model risk and the question is whether the 
pricing and hedging formulas corresponding to (57);epo,r| converge to the pricing 
and hedging formulas corresponding to (5;);ejo,r| when e goes to zero. This is what 
we intend in the sequel by robustness or stability study of the model. 

Inthis paper we focus mainly on the RM strategies. These strategies are considered 
under a martingale measure which is equivalent to the historical measure. Equivalent 
martingale measures are characterised by the fact that the discounted asset price 
processes are martingales under these measures. The problem we are facing is that 
the martingale measure is dependent on the choice of the model. Therefore it is clear 
that, in this paper, there will be different equivalent martingale measures for the two 
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considered price models. Here we emphasise that for the robustness study, we come 
back to the common underlying physical measure. 

Besides, since the market is incomplete, we will also have to identify which 
equivalent martingale measure, or measure change, to apply. In particular, we discuss 
some specific martingale measures which are commonly used in finance and in 
electricity markets: the Esscher transform, the minimal entropy martingale measure, 
and the minimal martingale measure. We prove some common properties for the 
mentioned martingale measures in the exponential Lévy setting in addition to those 
shown in [4, 6]. 

To perform the described stability study, we follow the approach in [8] and we 
relate the RM hedging strategies to backward stochastic differential equations with 
jumps (BSDEJs). See e.g. [7, 9] for an overview about BSDEs and their applications 
in hedging and in nonlinear pricing theory for incomplete markets. 

Under some conditions on the parameters of the stock price process and of the 
martingale measure, we investigate the robustness to the choice of the model of the 
value of the portfolio, the amount of wealth, the cost and gain process in a RM 
strategy. The amount of wealth and the gain process in a MVH strategy coincide 
with those in the RM strategy and hence the convergence results will immediately 
follow. When we assume a fixed initial portfolio value to set up a MVH strategy we 
derive a convergence rate for the loss at maturity. 

The BSDEJ approach does not provide a robustness result for the optimal number 
of risky assets in a RM strategy as well as in a MVH strategy. In [6] convergence rates 
for those optimal numbers and other quantities, such as the delta and the amount of 
wealth, are computed using Fourier transform techniques. 

The paper is organised as follows: in Sect.2 we introduce the notations, define 
the two martingale models for the stock price, and derive the corresponding BSDEJs 
for the value of the discounted RM hedging portfolio. In Sect. 3 we study the stabil- 
ity of the quadratic hedging strategies towards the choice of the model and obtain 
convergence rates. In Sect. 4 we conclude. 


2 Quadratic Hedging Strategies in a Martingale Setting for 
Two Geometric Lévy Stock Price Models 


Assume a finite time horizon T > 0. The first considered stock price process is 
determined by the process L = (Lr)rejo,r] which denotes a Lévy process in the 
filtered complete probability space (42, .7 , IF, P) satisfying the usual hypotheses as 
defined in [18]. We work with the càdlàg version of the given Lévy process. The 
characteristic triplet of the Lévy process L is denoted by (a, b*, £). We consider a 
stock price modelled by a geometric Lévy process, i.e. the stock price is given by 
S, = Soel, Vt € [0, T], where Sy > 0. Let r > 0 be the risk-free instantaneous 
interest rate. The value of the corresponding riskless asset equals e"! for any time 
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t € [0, T]. We denote the discounted stock price process by $. Hence at any time 
t € [0, T] it equals . 

Si = e"! S, = Soe et. 
It holds that 


d$, = $ âdt + $,bdW, + Ji (e* — 1)Ñ (dt, dz), (1) 
Ro 


where W is a standard Brownian motion independent of the compensated jump 
measure N and 


. l5 . 
üá—ca-—-r-4zb*-4 (e — 1 — zlqa«1)) £(dz). 
2 Ro 


It is assumed that $ is not deterministic and arbitrage opportunities are excluded (cfr. 
[21]). The aim of this paper is to study the stability of quadratic hedging strategies in 
a martingale setting towards the choice of the model. Since the equivalent martingale 
measure is determined by the market model, we also have to take into account the 
robustness of the risk-neutral measures. Therefore we consider the case where P is not 
a risk-neutral measure, or in other words à Æ 0 so that Sisnota P-martingale. Then, 
a change of measure, specifically determined by the market model (1), will have to be 
performed to obtain a martingale setting. Let us denote a martingale measure which 
is equivalent to the historical measure P by P. We consider martingale measures that 
belong to the class of structure preserving martingale measures, see [14]. In this case, 
the Lévy triplet of the driving process L under P is denoted by (à, D?, Č). Theorem 
III.3.24 in [14] states conditions which are equivalent to the existence of a parameter 
© € R and a function p(z; ©), z € IR, such that 


f |z (o(z; ©) — 1)| £(dz) < oo, 2) 
ES 
and such that 


a=at+bo «f z(p(z; @) — l)£(dz) and @(dz) = p(z; O)£(dz). (3) 
{lz1<1} 


For $ to be a martingale under P, the parameter © should guarantee the following 
equation 

X " 1 : P 

f= a- r+ z+ f (e? — 1 —zlqze1y) £(dz) = 0. (4) 


0 


From now on we denote the solution of Eq. (4)—when it exists—by O9 and the 
equivalent martingale measure by Pø. Notice that we obtain different martingale 
measures Po, for different choices of the function p(.; Oo). In the next section we 
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present some known martingale measures for specific functions p(. ; 99) and specific 
parameters Oo which solve (4). 

The relation between the original measure I? and the martingale measure Po, is 
given by 


dPa, 
dP 


1 t T 
— exp (row, — 5h ev «f n log (p(z; @0)) N (ds, dz) 
0 JRo 


«f (log (o(z; @o)) + 1 — p(z; e»t). 
0 


From the Girsanov theorem (see e.g. Theorem 1.33 in [17]) we know that the 
processes W® and N® defined by 


dW, = dW, — b@odr, (5) 
N^» (dr, dz) = N (dt, dz) — p(z; Og)£(dz)dt = N (dr, dz) + (1 — p(z; Oo)) £(dz)dt, 


for all t € [0, T] and z € Ro, are a standard Brownian motion and a compensated 
jump measure under Pop. Moreover we can rewrite (1) as 


d$, = S,bdW, + $, | (e* — 1)N (dt, dz). (6) 
Ro 


We consider an -Fr -measurable and square integrable random variable Hr which 
denotes the payoff of a contract. The discounted payoff equals Hr = e" Hr. In 
case the discounted stock price process is a martingale, both, the mean-variance 
hedging (MVH) and the risk-minimising strategy (RM) are related to the Galtchouk- 
Kunita-Watanabe (GKW) decomposition, see [11]. In the following we recall the 
GKW-decomposition of the Zr -measurable and square integrable random variable 
Hr under the martingale measure Po, 


me A T q A - 
Âr = E^ Br] + f gnas, 4 2P, (7) 
0 


where E® denotes the expectation under Pe, , £ is a predictable process for which 
we can determine the stochastic integral with respect to S, and Z® is a square 
integrable Po, -martingale with E dd = 0, such that 7 is Po, -orthogonal to $. 

The quadratic hedging strategies are determined by the process £^. It indicates 
the number of discounted risky assets to hold in the portfolio. The amount invested in 
the riskless asset is different in both strategies and is determined by the self-financing 
property for the MVH strategy and by the replicating condition for the RM strategy. 
See [19] for more details. 
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We define the process 


^ Op 


V^ = E™[Ar|F;], Ve € (0, T]. 


which equals the value of the discounted portfolio for the RM strategy. The GK W- 
decomposition (7) implies that 


: AG d g e c 
Vv, = Ve «f £ dS, F E vte [0, T]. (8) 
0 


Moreover since #® is a Po, -martingale, there exist processes X® and Y® (z) 
such that 


T WE n XP^dqw?» + f | ¥(z)N%(ds,dz), Wee[0,T], (9) 
0 0 JRo 
and which by the Po, -orthogonality of 7% and $ satisfy 
x9op «dH Y?» (z)(e* — 1) (z; G9)£(dz) = 0. (10) 
0 
By substituting (6) and (9) in (8), we retrieve 
dv, = Gi o$, b+ um dw,” +f (6S, (e* — 1) + Y» (2)) No (dt, dz). 


Let 9o = g9o $ indicate the amount of wealth invested in the discounted risky asset 
in a quadratic hedging strategy. We conclude that the following BSDEJ holds for the 
RM strategy 


dV?» = Aawa + f B® (z) N® (dt, dz), mm 
Ro 


^ 6 ^ 
Vr 9 — Hr, 


where 
A9» = $9op.p X% and BÓ9(z) = f99(e* — 1) + Y” (z). (12) 


Since the random variable Hr is square integrable and r-measurable, we know by 
[20] that the BSDEJ (11) has a unique solution (Y 8o, A®%, 0, B®). This follows from 
the fact that the drift parameter of Ųų% equals zero under Po, and thus it is Lipschitz 
continuous. 
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We introduce another Lévy process L^, for 0 « & « 1, which is obtained by 
truncating the jumps of L with absolute size smaller than £ and replacing them by 
an independent Brownian motion which is appropriately scaled. The second stock 
price process is denoted by S^ = Spe" and the corresponding discounted stock price 
process $* is thus given by 


dS° = $*a,dr + S£bdW, + sf (e — 1) N (dt, dz) + $G(e)dW,, (13) 


(zie) 


for allt € [0, T] and a = So. Herein W is a standard Brownian motion independent 
of W, 


G? (s) T (e* — D)? £(dz), and (14) 
{lzl<e 


1 
à =a-r +5 (0? + G(e)) «f (e — 1 — zlyz<1}) £(dz). 


{lz|=e} 


From now on, we assume that the filtration F is enlarged with the information of 
the Brownian motion W and we denote the new filtration by F. Moreover, we also 
assume absence of arbitrage in this second model. It is clear that the process L^ has 
the Lévy characteristic triplet (a, b? + G? (e), 1]42234) under the measure P. 

Let P, represent a structure preserving martingale measure for $*. The charac- 
teristic triplet of the driving process L^ w.r.t. this martingale measure is denoted 


by (à.. b? + G?(e), ls}: From [14, Theorem III.3.24] we know that there exist a 


parameter © € R and a function p(z; ©), z € R, under certain conditions, such that 


f Iz(o(z; ©) — 1)| £(dz) < oo, (15) 

{e<lz|<1} 

äs =a + (0? di G(e)) O «f z (p(z; O) — 1) £(dz), and (16) 
{e<|z|<1} 

be (dz) = lpz|ze}(z; 9)£ (dz). (17) 


Let us assume that © solves the following equation 


1 " 
a-ri (+E) S (E —1-clyen) Ëz =0, — (18) 


Ro 


then $* isa martingale under P. From now on we indicate the solution of (18)—when 
it exists—as ©, and the martingale measure as Po, . 
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The relation between the original measure P and the martingale measure Po, is 
given by 


dPo, 
dP 


= 8 i 260,2 cQ Ww al 2 2 
g = xp (bO:W, — Zb°O0°t + GE)O W, — 5G*(@)O."t 


t 

+f J log(p(z; ©+))N (ds, dz) 
0 Jflzze) 

+f (log (o(z; ©:)) + 1 — plz; e))«2). 
(lle) 


The processes wes, We, and N® defined by 


dWÉ* = aw, — b@, dt, 
dwe: = dW, — G(e)@edt, 
N® (dt, dz) = N (dt, dz) — p(z; O-)€(dz)dt 
= N(dt, dz) + (1 — p(z; 0,))£(dz)dt, (19) 


for allt € [0, TJ andz e (ze R: |z| = £}, are two standard Brownian motions and 
a compensated jump measure under Po, (see e.g. Theorem 1.33 in [17]). Hence the 
process $* is given by 


dô? = $*baw,” + $° ji (e — 1)N® (dt, dz) + SPG(e)dWP*. — Q0) 


ES 


We consider an .Z 7 measurable and square integrable random variable H7. which 
is the payoff of a contract. The discounted payoff is denoted by Ht. Le Hr. The 
GKW-decomposition of lis. under the martingale measure Po, equals 


T 
HE = tin f £9d$5 +. gre, (21) 
0 


where ẸE®: is the expectation under Po,, &% is a predictable process for which 


we can determine the stochastic integral with respect to S*, and Z® is a square 
integrable Po, -martingale with E = 0, such that .Z/?* is Po, -orthogonal to $*. 
The value of the discounted portfolio for the RM strategy is defined by 


V9: =E°[AE|F,], vr e[0, T]. 


From the GKW-decomposition (21) we have 


t 
$9: = VP + f £9«d$* 4 Z, wr e [0, T]. Q2) 
0 
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Moreover since .Z/9* is a Po,-martingale, there exist processes X Oe Y9* (7). and 
Ze such that 


t : 7 t t i E 
go -f xerawe f I YO! (z)N% (ds, dz) «f Z9:aW8:, vr e[0, T]. 
0 0 J{lzlZe} 0 


zs (23) 
The Pe,-orthogonality of 7°: and $° implies that 


X95 +f Y% (z)(e* — 1) p(z; Oc) €(dz) + Z” Ge) = 0. (24) 
{Iz|2e} 
Combining (20) and (23) in (22), we get 


dV," = (gP Seb + xy) dW£* + f (E Sf (e — 1) + Y c)) N (ar, az) 
(zie) 


+ (E SP Ge) + zP) aiv. 


Let 7 = &% $* denote the amount of wealth invested in the discounted risky asset 
in the quadratic hedging strategy. We conclude that the following BSDEJ holds for 
the RM strategy 


dV, = AP awe" + J Bf @N® (dt, dz) + CP awe, 
{izi=} (25) 


where 


A9: = $9:b + X%, B9; = $9:(e* — 1)--Y8*(z), and (26) 
C*: =H" Ge) + Z”. 


Since the random variable Ha is square integrable and Fr-measurable we know 
by [20] that the BSDEJ (25) has a unique solution (Vo, A®%, BP, Ce), This 


results from the fact that the drift parameter of VO: equals zero under Po, and thus 
is Lipschitz continuous. 


3 Robustness of the Quadratic Hedging Strategies 


The aim of this section is to study the stability of the quadratic hedging strategies 
to the variation of the model, where we consider the two stock price models defined 
in (1) and (13). We study the stability of the RM strategy extensively and at the end 
of this section we come back to the MVH strategy. Since we work in the martingale 
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setting, we first present some specific martingale measures which are commonly 


used in finance and in electricity markets. Then we discuss some common properties 
which are fulfilled by these measures. This is the topic of the next subsection. 


3.1 Robustness of the Martingale Measures 


Recall from the previous section that the martingale measures Po, and Po, are deter- 
mined via the functions o(. ; O9), p(. ; ©) and the parameters Oo, Os, respectively. 
We present the following assumptions on these characteristics. 


Assumptions 1 For Oo, Os, o(.; Oo), and p(.; Os) satisfying Eqs. (2)-(4), and 
Eqs. (15)-(18) we assume the following, where C denotes a positive constant and 
O € (69, Oe}. 


(i) Oo and ©, exist and are unique. 
(ii) It holds that 7 
[8o — Oc| < CG*(s), 


where G(e) = max(G (e), o(€)). Herein o (£) equals the standard deviation of 
the jumps of L with size smaller than e, i.e. 


o*(e) = J z2t (dz). 
{lz|<e} 
(iii) On the other hand, ©, is uniformly bounded in e, i.e. 
[O| x C. 
(iv) For all z in {|z| < 1} it holds that 


lez; O)| < C. 


(v) We have 
| p^ (z; O)£(dz) < C. 
(li 1) 


(vi) It is guaranteed that 


n (1 — p; 8))^£(d2) < C. 
Ro 
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(vii) It holds for k € (2, 4] that 
f (Pz; @o) — pz; 6) £(dz) < CG^* (e). 
0 


Widely used martingale measures in the exponential Lévy setting are the Esscher 
transform (ET), minimal entropy martingale measure (MEMM), and minimal mar- 
tingale measure (MMM), which are specified as follows. 


e In order to define the ET we assume that 


| e?*((dz) «oo, VOER. (27) 
(1z]z 1) 


The Lévy measures under the ET are given in (3) and (17) where p(z; ©) = e?z, 
The ET for the first model is then determined by the parameter Oy satisfying (4). 
For the second model the ET corresponds to the solution ©, of (18). See [13] for 
more details. 

Let us impose that 


| e?-Dp(dz) zoo, WER, (28) 
{z121} 


and that o(z; ©) = e9 (€ -U in the Lévy measures. Then the solution Oo of 
Eq. (4) determines the MEMM for the first model, and ©, being the solution of 
(18) characterises the MEMM for the second model. The MEMM is studied in 
[12]. 

e Let us consider the assumption 


f e" ((dz) < oo. (29) 
(zz1) 


The MMM implies that p(z; ©) = O (e? — 1) — 1 in the Lévy measures and the 
parameters Oo and O, are the solutions of (4) and (18). More information about 
the MMM can be found in [1, 10]. 


In [4, 6] it was shown that the ET, the MEMM, and the MMM fulfill statements (1), 
(ii), (iii), and (iv) of Assumptions 1 in the exponential Lévy setting. The following 
proposition shows that items (v), (vi), and (vii) of Assumptions 1 also hold for these 
martingale measures. 


Proposition 1 The Lévy measures given in (3) and (17) and corresponding to the 
ET, MEMM, and MMM, satisfy (v), (vi), and (vii) of Assumptions 1. 


Proof Recall that the Lévy measure satisfies the following integrability conditions 
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J z)&(dz)-oo and J £(dz) < oo. (30) 
(Iz| 1) {z121} 


We show that the statement holds for the considered martingale measures. 


Under the ET it holds for © € {O0, Os} that 
Peja Pt «gt 


because of (iii) in Assumptions 1. By the mean value theorem (MVT), there exists 
a number ©’ between 0 and O such that 


(1— p; 8)) = ze9*e? < s (Ipa<ne 2? + Ls e 02) C, 


where we used again Assumptions 1 (iii). For k € {2, 4}, we derive via the MVT 
that 


k fz 
(P; Oo) — pi &))* = Po (1 — e0) — etze" — e, 
where ©” is a number between 0 and ©; — O9. Assumptions 1 (ii) imply that 
k 2 ES 
(oz; 09)—p(z @c)) < (a Oz 24 lüz- peer ner CG*(e). 
The obtained inequalities and integrability conditions (27) and (30) prove the 
statement. 
Consider the MEMM and © € (O9, Os}. We have 
p*(z; 9) = ef9 € -D << efCle 1 
because of (iii) in Assumptions 1. The latter assumption and the MVT imply that 


(1 — p(z; 0)? = (e — 129€ Ye? 
s (Ieee? + 1e yePCE-Y) c. 


We determine via the MVT and properties (ii) and (iii) in Assumptions | for 
k € (2, 4} that 


k 
(o(z; Oo) — p(z; Oe)) 
ENT (1 _ ee -ene-ny 


= eb ole) (e? — p kgko" E-D (O) — gy 


< [igne eee, 24 lene era) CG? (e) 
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From (28) and (30) we conclude that (v), (vi), and (vii) in Assumptions 1 are in 
force. 

e For the MMM we have 

pt: 9) = (Oe — 1) - 1)! < C(e* + 1). 
Moreover it holds that 
(1 - pz; 9)! = - 26? < (uc<ne?2” + Maie + )) e 


We get through (ii) and (iii) in Assumptions 1 that 


(P; 89) — PE; @e))* = (e? — D*(& — 0») 
< (tinez? + lug iy + 1) CG?' (e), 


for k € (2, 4}. The proof is completed by involving conditions (29) and (30). 


3.2 Robustness of the BSDEJ 


The aim of this subsection is to study the robustness of the BSDEJs (11) and (25). 
First, we prove the L?-boundedness of the solution of the BSDEJ (11) in the following 
lemma. 


Lemma 1 Assume point (vi) from Assumptions 1. Let (V2, A9». B®) be the solu- 
tion of (11). Then we have for all t € [0, T] 


T T T 
| / copas + | f atras |+ | f I Beroes] < CELA], 
t t t Ro 


where C represents a positive constant. 


Proof Via (5) we rewrite the BSDEJ (11) as follows 


dv, = (-ve0a x L B? — p(z; o(a) dt 
0 


T. apaw, + f B® (2) Ñ (dt, dz). 
R 


0 


We apply the Itó formula to ef’ (V22 and find that 
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d (Py) 


= peh! (V7 dt + 208 VA (oes + I B” (2) — plz; ovela) dt 
Ri 


0 
+ 2ePt VAP aw, + e! (4P?) at 


+f e^ (02 ao) - 8) Nar a f BPE aor 
Ro Ro 


By integration and taking the expectation we recover that 


[e^ cy 
rf 
= E [#7 (0)| - pE | I oF (0,00) 
t 
T 
| / efs 8o (eo + n B® (z)(1— pz; e»t) a| (31) 
t Ro 


T T 
= | / "age | - | J n ^ac. 
t t Ro 


Because of the properties 


l 
L2 


1 
for all a, b € R and k € Rọ it holds that + 2ab < ka? + -a (32) 


and 


n 2 n 
forall n € N and for all a; € R,i = 1,...,n we have that (Èa) < n> a, 
1 


i=l 


the third term in the right hand side of (31) is estimated by 
D ^ c > " 
— 2E | / es Vo (oe? «f BÉ»(z — p(z; e») as| 
t Ro 
T ^6 1 : 2 
<E / ePs k(9 9o)? 3E » (oe? «f B®(z)(1 — p(z; e»t) ds 
t Ro 
T - 2 T 
< KE | / ^c + Cb eyE | i efs (atas 
t k , 
2 T : 
T Ji (1 — p(z; 09)? ((dz)E U/ a) (B®? (cota: . 
k Jmo i Ro 


Substituting the latter inequality in (31) leads to 
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O02 T 35-6002 NES z 89.2 
E [e^ c; 0) ]*6-»& f ebs (9,245 | + (1- PLAT je f ebs (490)2ds 
t t 
2 2 d Bs e 2 
+ (: — J (1 — p(z; 09)) taz) E f e n (By ? (z))*£(dz)ds 
k Ro t Ro 


<E [^r ép? (34) 


Let k guarantee that 


is roel pii Ji Ut an td > 1 
—-— = an —-— — : =. 
k 3 tine oo “27 


Hence we choose 


k > 4max (eov. | (1— p(z; e» t2) > 0, 
0 


which exists because of (vi) from Assumptions 1. Besides we assume that B > 
k+ 5 > 0. Then for s € [0, T] it follows that 1 < e/* < eT and from (34) we 
achieve 


T T T 
E | | comas +E | | (aas +E | I [ Boaza] ep 
t t t J Ro 


which proves the claim. 


In order to study the robustness of the BSDEJs (11) and (25), we consider both 
models under the enlarged filtration F since we have for all t € [0, T] that F, C F+. 
Let us define 

V= VO e, A= A A, BPG) = BI) — Tuas B (o), 


We derive from (5), (11), (19), and (25) that 
dV, = afdt + A£dW, «f B* (z)N (dt, dz) — CP*aW,, (35) 
Ro 


where 
a? = —b(Q)A™ — 8,A9*) + G(e)O,C (36) 


+ [ (B® (2) (1 — p(z; 89)) — Lyzj>eyB (z) (1 — p(z; Oe))) £(dz). 
0 
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The process o (36) plays a crucial role in the study of the robustness of the 
BSDEJ. In the following lemma we state an upper bound for this process in terms of 
the solutions of the BSDEJs. 


Lemma 2 Let Assumptions 1 hold true. Consider a® as defined in (36). For any 
t € [0, T] and B € R it holds that 


T 
| J e^enas| 
' E T T 
«c(s*o| | J CIE | f efs ji Boaza] 
t t R 
T - T - j 
+ | J Apa] + | i efs n Giro 
t t Ro 
T 
+ | I che. 
t 


where C is a positive constant. 


Proof Parts (ii) and (iii) of Assumptions | imply that 


|—b (9949? — e, A?) 


< |bl|8o — «|| A?^| + |b]|G. || AP» — AP | 
< CG? (e)|AS| + CIAS] 


and 
IG(e)O,C*| < c|c£*|. 


From Hólder's inequality and Assumptions 1 (vi) and (vii) it follows that 
| f (BP) (1 = plz 69) = ls} BP G) (1 = ple; 92) (2) 
0 


«|f, 8e vc 60 - pe o» to] +| f. 8e a - o 0» tto) 
° 1/2 ° 1/2 

<([ (en - oov) (f Braa) 
Ro Ro 


2 1/2 - 1/2 
* ( n (1 — p(z; @e)) (a2) ( l Boao) 
Ro Ro 


= 1/2 - 1/2 
< CG?(e) ( n (Bota) +C ( J T l 
Ro Ro 
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We conclude that 


(ac <C (Ge [ae 4 n Baoa] 


0 


4 (AD + f (BOY (do) + ce») 
0 


The statement is easily deduced from this inequality. 


With these two lemmas ready for use, we state and prove the main result of this 
subsection which is the robustness of the BSDEJs for the discounted portfolio value 
process of the RM strategy. 


Theorem 1 Assumptions | are in force. Let (V2, A®, B®) be the solution of (11) 
and (Ve, A®:, pe, c9) be the solution of (25). For some positive constant C 
and any t € [0, T] we have 


4 Ẹ 
| (ge $us] * il (AQ? — Aya, 
? t 


r T 
ls l i (BP? (z) — leza BÉ 0) eas] LE U/ (c2)%4s] 
t 0 


< c (E[ (Ar - Bi] + GOELA). 


, 


Proof We apply the Itó formula to ef (vey 
d (e^ cz?) = pef! (VE) dt + 2e”! V*atdr + 2e! V? AtaW, — 2e? VEC? aW, 
+ ef! (A*y?dr + ef! (CO)? dt + n ef! (B£ (z))? t(dz)dt 
0 


«f ef! (c. + BR — (y?) N (dr, dz). 
Ro 


Integration over the interval [t, T] and taking the expectation under P results into 
= - T - T - 
7 [eco] =E [e^7 cie | — BE |/ eara: -2E U/ efs Tazas 
t t 
T - T - 
E | J e Anas] -E | J n e^ di yrs 
t t JRo 
T 
-E | li cor] ; 
t 
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or equivalently 
- T - 
nl [em coy | + x U/ "dear 
T S T 
+E | J f "dires +E | / "egy 
t R t 
- i T - T - 
=E [c7 ce] — BE p eaa] — 2E || es Vasa | 
a I - i T 
< Id + (k— B) Jh e^arras| dec | | apas], (37) 
t t 
where we used property (32). The combination of (37) with Lemma 2 provides 
i T - C T - 
a [e^ VA] + (6 — OE || eas] + (1 - 3 | | dias] 
(di Tg 
+ (: = 3L nE a (BP co? | + ( - £) | | co] 
t 
< Ud à aC Af oA) 4s] (38) 


T 
+ Jn ae) (aco? |. 
t Ro 
1 


Let us choose k and f such that 1 — C 25anddf-k- 1. This means we choose 


k > 2€ > OandB > $ +k > 0. Thus forany s € [t, T] itholdsthat 1 < eP* < ef". 
We derive from (38) that 


T - 
Lf ores ee 
T 
«s| [ f eo toas] + | f aee] 
Ro 

F 

pp?) e o [s if (aas + ‘|| [ eoe |). 
t Ro 


By Lemma 1 we conclude the proof. 


<C 


a 


This main result leads to the following theorem concerning the robustness of the 
discounted portfolio value process of the RM strategy. 


Theorem 2 Assume Assumptions 1. Let V%, V% be part of the solution of (11), 
(25) respectively. Then we have 
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J l sup (Vo — ey <C ( qr — ÂE)? + G^(e)E[RZ] 


O0<t<T 


for a positive constant C. 


Proof Integration of the BSDEJ (35) results into 


T T T T 
Ve = -j afas- [ iow, - f [ Bro Gs do + f CAW. 
0 


By property (33) we arrive at 
dX 
O0<t<T 
7 T T 2 

<5 E[ cp? «x * (of))ds | +E] sup (/ &av;) 
0 O<t<T \Jt 
M : T oui 

-E| sup (/ f io Ras ao) -E| sup (/ C; ai, . 
Oxr-T t Ro O<t<T t 


Burkholder’s inequality (see e.g., Theorem 3.28 in [15]) guarantees the existence of 
a positive constant C such that 


T 2 T | 
| sup (/ aw.) | <C ‘| / itas]. 
OxtxT t 0 
T 7 E 2 T - 
| sup ( / f Br os a) | <C | T f Gro eaoas. 
OxrxT \Jt JRo 0 JRo 
T _\2 T 
| sup (/ ctii.) | <C in eis]: 
OxrxT t 0 


Application of Lemma 2 for t = 0, 8 = 0, Lemma 1, and Theorem | completes the 
proof. 


3.3 Robustness of the Risk-Minimising Strategy 


Theorem 2 in the previous subsection concerns the robustness result of the value 
process of the discounted portfolio in the RM strategy. Before we present the stability 
of the amount of wealth in the RM strategy, we study the relation between 7 
(resp. 7%) and the solution of the BSDEJ of type (11) (resp. (25)) in the first 
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(resp. second) model. Consider the processes A® and B® (z) defined in (12), then 
it holds that 


Ap «f B® (z)(e* — 1) p(z; Oo) £(dz) 
Ro 


= 90b? + XH + l (2e — 1)? p(z; Oo) + Y” — Lolz; @o)) (dz) 
Ro 


= 7% D «f (e — 1? (z; epo] 
Ro 


+ X%p «f Y9»(z)(e* — 1) p(z; 09)£(dz). 
0 


From property (10) we attain that 
G 1 G 6 2 
g9 = — (4% + l B” Ze — Nok: aveta) ; (39) 
Ro 


where xo = b? + Jro (e? — D? p(z; O9)£(dz). Similarly for the second setting we 
have for the processes A%, B9* (z), and C?* defined in (26) that 


A9sp «f B®! (z)(e* — 1)p(z; O,)£(dz) + C?* G(e) 
{|z|>e} 
249 [o +f (e — 1? p(z; O,)£(dz) + ce) 
{|zl>e} 


+X%H4 f Y* (e - Deci eati) + ZG). 


(zie) 
Property (24) leads to 


: 1 z O 
ges a (4% + | B® (z)(e* — Dp; Oc)e(dz) + c*ae)) , (40) 
{lz|=e} 


Ke 
where x, = b? + fiaz č — 1) 0 (z; 0,)£(dz) + G?(e). 


We introduce the following additional assumption on the Lévy measure which we 
need for the robustness results studied later. 


Assumption 2 For the Lévy measure £ the following integrability condition holds 


/ e* ((dz) < oo. 
{z221} 
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Note that the latter assumption, combined with (30), implies for k € (2, 4} that 


f (e* — 1) 'Z(dz) < C (/ z2t(dz) «f £(dz) «f uaa) < oo. 
Ro TEST (zi 1) {c= 1} ai 


Moreover Assumption 2 is fulfilled for the considered martingale measures 
described in Sect. 3.1. Indeed, consider the ET, applying (27) for 0 = 4 and restrict- 
ing the integral over {z > 1} implies Assumption 2. On the set {z > 1} it holds that 
z < e* — 1 and therefore Assumption 2 follows from (28) by choosing 6 = 4. For 
the MMM, condition (29) corresponds exactly to Assumption 2. 


Theorem 3 Impose Assumptions 1 and 2. Let the processes *® and $1?* denote 
the amounts of wealth in a RM strategy. There is a positive constant C such that for 
any t € [0, T] 


T 
| f e&- #45] < c (Eli - A5 + G4 )ELARI). 
t 


Proof Consider the amounts of wealth in (39) and (40). Let us denote HO = m yo 
and £9: = 1°. Then it holds that 


2 
(22 _ gey « «(5 = z) (ry ES ly? m ar) 
K K 


[2 


Herein we have because of the Hólder's inequality, (14), (41), and properties (iv) and 
(vii) in Assumptions 1 that 


ko—= k)? 3 i 
( ) = (/ (e — 1? p(z; e»t) 
KOKe b {Izl<e} 


2 
4 ( f (e? — D? (p(z; 89) — plz: 9.42) 4 2 
{|z|=e} 


oe C (/ (e — n] 
(p {lel<e) 


«f (e* — ted) | (p(z; O9) — p(z; Ox))* £(dz) + G^(e) 
Ro Ro 


I^ 


I^ 


CG* (6). 
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On the other hand it is clear from (39) and (40) that 


pr? _ yey 
< 3 ((A*? b? + (C9? G? (e) 


i 2 
+ ( f (B9"(z)(e* — 1)p(z; Oo) — 1g») B® (z)(e* — 1) (z; 2 J 


Herein we derive via Hólder's inequality, (30), (41), and points (iv), (v), and (vii) in 


Assumptions 1 that 
2 
( ji (B® (z)(e* — 1)p(z; O0) — lizze} BO (z)(e* — D)o(z; e») 

2 
= ( f (B%(z)(p(z; 09) — p(z; Oc))(e* — 1) + B° ()o (z; Oe) (e7 — Dea») 
< (B?»(z)? (dz) | (p(z; 9) — p(z; OX)? (E — 1)7£(dz) 

Ro Ro 
«f Gr cea f o^ (z; Oe) (e* — 1)? £(dz) 
Ro Ro 


< f (B?»(z)? (dz) ( f (p(z; 89) — p(z; ottaa) ( e - vea) 
Ro Ro Ro 


1 


1 
3 


4 f (B° (z))? t(dz) ( n p^ (z; 8«)£(dz) e - otea) 
Ro HEHESI 


+C l (B® (z)) £(dz) f z?t(dz) 
Ro {|z|<1} 


«DG f (B® (z))? £(dz) + C f (B° (z))! £(dz). 
Ro Ro 


{lzl>1} 


The results above show that 
(-) ^ (9 2 = = (-) 
(s^ - a^) «c (an * a (BF OY edz) + (CP)? 
0 


t 
+ G4(e) [at f a ceo). 
0 
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Therefore 

T 
Di || (i o = sas] 

t 2 i . | 
<C ( z | J diras + | 1 | (B; cea: + | J ctas 
t + JRo 
d T 
o) | ff (agra | EN | / ri Cd l) l 
t " Ro 


By Lemma 1 and Theorem 1 we conclude the proof. 


The trading in the risky assets is gathered in the gain processes defined by p = 
h B ? d$, and Gee = h &P*q$s. The following theorem shows the robustness of 
this gain process. 


Theorem 4 Under Assumptions 1 and 2, there exists a positive constant C such that 
for any t € [0, T] 


zz RV) x - E 
: (6% - 6| < c (mir - Ap eoim). 
Proof From (5) and (6) we know that 
£%d8, = EP S baw + ES S, I (e* — 1)N (ds, dz) 
Ro 
=n, (Ga +f (e — D (L— p(z; ey» e(a) ds 
0 
es + [ e - nias a2). 
Ro 

In the other setting we have from (19) and (20) that 
£P: d$: = EP: SehdWO® + ERE SE J (e* — 1)N% (ds, dz) + E S eN" 

{|z|ze} 


= 40 ((-»e. «f (e — 1) (1 — p(z; @,)) £(dz) — c'e.) ds 
(Ize) 


+ baw, + | 


(zie) 


(e€ — DN(ds, dz) + yai; f 
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We derive from the previous SDEs that 
^ ^ t ^ t ^ 
or eg f Pd f £O: ase 
0 0 
t 
= (-2o+ | (e* — D (1 — p(z; ey eav) f Rds 
Ro 0 
t 
— (-ve. «f (e* — 1) (1 — p(z; @,)) £(dz) — cee.) | 7 ds 
{IzlEe} 0 
$ t 
+b f (&9* — &9-)aW, + f f (2e — 1) — #9 Leze — 0) Ñ (ds, dz) 
JO JO JRo 
t 
= co f f 9dW,. 
0 


Via the Cauchy-Schwartz inequality and the Itó isometry we obtain that 
m V 
| (62 2 Ge) | 
t 
sÈ ( z | li aera (^e + n (D - pG; e» e(a) 
0 Ro 


2 
= (-ve. «f (e — 1) (1 — p(z; @e)) £(dz) — c'e. )| 
(Ice) 


t 
+ | n (i go - #9+)as| 
0 


2 
x (-»e. «f (e€ — D (1 — p(z; O¢)) £(dz) — coe.) 
{Izl>e} 


t t 
+ | n (gp — aus] + G'(e)E | f Gas 
0 0 


ik 
i a i (&P (e* — 1) - i Lizel — DEC 
0 JRo 


wherein 


t 
£ n n (APE 1) - £P 1g gea (e? — DEC 
0 JRo 


t 
«2E | | I (GRE — DP gau + G -PPE — Deen) tto 
0 JRo 


t 
<2 ( f (e — D?4(dz)E l n (i go «| 
{Izl<e} 0 
t 
«f (e? — D? £(dz)E p (70 — yas) ; 
Ro 0 
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and 


f 
[frre] mf nra] mtra 
0 


By relation (14), Assumptions 1, (39), (41), Lemma 1, and Theorem 3 we prove the 
statement. 


The following result shows the robustness of the process .Z/? appearing in the 
GKW-decomposition. This plays an important role in the stability of the cost process 
of the RM strategy. 


Theorem 5 Let Assumptions 1 and 2 hold true. Let the processes £° and Z°: 
be as in (9) and (23), respectively. For any t € [0, T] it holds that 


a CZ — 971 < c (Bl Ar — ÊRI + ELAR), 


for a positive constant C. 


Proof By (5) we can rewrite (9) as 


dZ” = (sew t f YOU- p; e»t) dt 
R 


0 


+ xf^aw, + n Y,°°(z)N(dt, dz). 
R 


0 


and similarly by (19) we obtain for (23) 


qe = (-v0.x? + 1 Y” (z)(1 — pE; O:))Ldz) — coaz) dt 
{|z|=e} 


+ XÉP*dw, + | YP (z)N (dt, dz) + ZP*aW,. 


(ize) 


Hence we recover that 
O0 Os E ve 723 AT Gg aD 
dCZ, ? —.25*) = yr dt + X dW, «f Y; (z) N (dt, dz) — Z; *dW;, 
Ro 


where 
yE = —b(@oX™ — O,X%) + G(e)O,Z% 
+ ri (YOA — pz: 09) — 1s Y9* (2) — p; Oz) £(dz), 
0 


X* = XO — XO, 
Y*(z) = Y9*(z) — 194-4 Y ** (2). 
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By integration over [0, t] and taking the square we retrieve using (33) that 


t 2 t 
(2 — Z < c((/ yids) 4 (/ Xam.) 
0 0 


ZG — 


Concerning the term 


1 that 


iU 


2 


t = 2 t E 2 
+ (/ f jr Ras. a2) «(f z?ii.) ) 
0 JRo 0 


Via the Cauchy-Schwartz inequality and the Itó isometry it follows that 


um « C 


RM 


(0X? — e, x9) as| 


t t 
z p vias + T 12 
0 0 
t t 
+ | | f Pioda] + j| aya ; 
0 JRo 0 


D | i (y? Pas] we derive through (ii) and (iii) in Assumptions 


=| 
<c( 


t 
z p (Oo — ege + 
0 


t 
G^(e) | f atas] + | f 
0 0 


t 
ap) O (XK = xeu 


t 
aal) 


and via (vi) and (vii) in Assumptions 1 it follows that 


< n (o (z; 
Ro 


+ I (1 — p(z; 0) (dà) 
Ro 


<C (~o 


t 2 
D [ n (YP^(z) (1 — pE; 89)) — las Y?" (2) A — pE; 62) a} «| 
0 


t = 
Oo) — p(z; e» «a f l ar eyeaoys| 
0 0 


f 
| n n w! d + 
0 JRo 


| n n (vs d 
0 JRo 


t 
| n I (vs c»? |) 
0 JRo 
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Thus we obtain that 


VUE — 27] 


f t 
< C GO | U i aas] +E p f w! co*eaas || 
0 0 JRo 
t t 
0 0 JRo f 


(42) 


Let us consider the terms appearing in the latter expression separately. 


e Definition (12) implies that 


£ t I 
| | atas] <2 ( z i (aas t PE p aera) 
0 0 0 
and 


t 
1 | | areas 
0 JRo 
t t 
«2(s] n n Boadas | + f e - n*eaog[ f ceras. 
0 IRo Ro 0 


e Combining (12) and (26) in 


ve Oo 0. A6 ^ OO a Oe 
X =X, —X,^,—APj—(Qu —m,-)b, 


it easily follows that 


fro] ee (era ef ana) 
0 0 0 


e Similarly, from (12) and (26) we find 


F(z) = YPE) — YE) = BE(z) — GO — Hye — 1). 


Hence 


t 
e| [ [! PEOP as 
JO JRo 
t t 
< 2(E | f [i Git cota + n (e — D? £(dz)E | n (i 8» -ieya]). 
0 J Ro Ro 0 
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ZO x € (CP + &P — 8! a* 6) + HYG?) 


leads to 


+G? (e) 


t t t 
| f (z& cas <C ( z, | f ctas] + G?(s) | I p = 22a 
0 0 0 
t 
T cea 
0 


e Because of (39) and (vi) in Assumptions 1 we notice that 


[s t 
T eras] <C ( ) i (aos + 
0 0 


| i | (co? |) 
0 JRo 


Using (41) and the combination of the above inequalities in (42) show that 


t t 
ELL” — g^ y <C (Po fe | | (aas +E | n [i (ap di 
0 0 J Ro 


t t 
+E | f (aas +E | | Í Gr oaa: 
0 J0 JRo 
t t 
+E | | (chas +E | n aa — zo) 
JO 0 


Finally by Lemma 1 and Theorems 1 and 3 we conclude the proof. 


The cost processes of the quadratic hedging strategy for Hr, He are defined by 
K% = Z% + y and Ke = 9% + yrs The upcoming result concerns the 
robustness of the cost process and follows directly from the previous theorem. 


Corollary 1 Under Assumptions 1 and 2, there exists a positive constant C such 
that it holds for all t € [0, T] that 


Proof Notice that 


qc — KP] < c( 


BK? — KP] «2( 


EG - yl 


a (Ar — B5!) + G'ORLÁI). 


(Vo — $51) 
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wherein 


O0<t<T 


Ve? — 9921 <E l sup (VP — ver 


Theorems 2 and 5 complete the proof. 


3.4 Robustness Results for the Mean-Variance Hedging 


Since the optimal numbers £9? and £€* of risky assets are the same in the RM and 
the MVH strategy, the amounts of wealth 7° and 7% and the gain processes G% 
and G®: also coincide for both strategies. Therefore we conclude that the robustness 
results of the amount of wealth and gain process also hold true for the MVH strategy, 
see Theorems 3 and 4. 

The cost for a MVH strategy is not the same as for the RM strategy. However, 
under the assumption that a fixed starting amount Vo is available to set up a MVH 
strategy, we derive a robustness result for the loss at time of maturity. For the models 
(1) and (13), it holds that the losses at time of maturity T are given by 


T 
L9» = Îr — VW -f £Ooq$,, 
0 
T 
L9: = HE — Vo -f EPrd SE, 
0 


When Assumptions | and 2 are imposed, we derive via Theorem 4 that 


3 (L9 — Ly] < c (El(Ar — ÂP] + G?@)ELAF)), 


for a positive constant C. 

Note that we cannot draw any conclusions from the results above about the robust- 
ness of the value of the discounted portfolio for the MVH strategy, since the portfolios 
are strictly different for both strategies. 


4 Conclusion 


Two different geometric Lévy stock price models were considered in this paper. We 
proved that the RM and the MVH strategies in a martingale setting are stable against 
the choice of the model. To this end the two models were considered under different 
risk-neutral measures that are dependent on the specific price models. The robustness 
results are derived through the use of BSDEJs and the obtained L?-convergence rates 
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are expressed in terms of estimates of the form [CH — He )?]. The latter estimate is 
a well studied quantity, see [3, 16]. In the current paper, we considered two possible 
models for the price process. Starting from the initial model (1) other models could 
be constructed by truncating the small jumps and possibly rescaling the original 
Brownian motion (cfr. [8]). Similar robustness results hold for quadratic hedging 
strategies in a martingale setting in these other models. 

In [8] a semimartingale setting was considered and conditions had to be imposed 
to guarantee the existence of the solutions to the BSDEJs. In this paper however, 
we considered a martingale setting and, since there is no driver in the BSDEJs, 
the existence of the solution to the BSDEJs was immediately guaranteed. On the 
other hand, since the two models were considered under two different martingale 
measures, we had to fall back on the common historical measure for the robustness 
study. Therefore, a robustness study of the martingale measures had to be performed 
and additional terms made some computations more involved compared to the semi- 
martingale setting studied in [8]. 

In this approach based on BSDEJs we could not find explicit robustness results for 
the optimal number of risky assets. Therefore we refer to [6], where a robustness study 
is performed in a martingale and semimartingale setting based on Fourier transforms. 
Note that in [6] robustness was mainly studied in the L-sense and the authors noted 
that their results can be extended into L?-convergence, whereas L?-robustness results 
are explicitly derived in the current paper. 
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Risk-Sensitive Mean-Field Type Control 
Under Partial Observation 


Boualem Djehiche and Hamidou Tembine 


Abstract We establish a stochastic maximum principle (SMP) for control problems 
of partially observed diffusions of mean-field type with risk-sensitive performance 
functionals. 


Keywords Time inconsistent control - Maximum principle - Mean-field SDE - 
Risk-sensitive control * Partial observation 


AMS subject classification: 93E20 - 60H30 - 60H10 - 91B28. 


1 Introduction 


In optimal control problems for diffusions of mean-field type the performance func- 
tional, drift and diffusion coefficient depend not only on the state and the control but 
also on the probability distribution of the state-control pair. The mean-field coupling 
makes the control problem time-inconsistent in the sense that the Bellman Princi- 
ple is no longer valid, which motivates the use of the stochastic maximum (SMP) 
approach to solve this type of optimal control problems instead of trying extensions of 
the dynamic programming principle (DPP). This class of control problems has been 
studied by many authors including [1, 2, 5, 7, 15, 20]. The performance functionals 
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considered in these papers have been of risk-neutral type i.e. the running cost/profit 
terms are expected values of stage-additive payoff functions. Not all behavior, how- 
ever, can be captured by risk-neutral performance. One way of capturing risk-averse 
and risk-seeking behaviors is by exponentiating the performance functional before 
expectation (see [17]). 

The first paper that we are aware of and which deals with risk-sensitive optimal 
control in a mean field context is [24]. Using a matching argument, the authors 
derive a verification theorem for a risk-sensitive mean-field game whose underlying 
dynamics is a Markov diffusion. This matching argument freezes the mean-field 
coupling in the dynamics, which yields a standard risk-sensitive HJB equation for 
the value-function. The mean-field coupling is then retrieved through the Fokker- 
Planck equation satisfied by the marginal law of the optimal state. 

In a recent paper [11], the authors have established a risk-sensitive SMP for 
mean-field type control. The risk-sensitive control problem was first reformulated 
in terms of an augmented state process and terminal payoff problem. An intermedi- 
ate stochastic maximum principle was then obtained by applying the SMP of ([5], 
Theorem 2.1.) for loss functionals without running cost but with augmented state 
in higher dimension and complete observation of the state. Then, the intermediate 
first- and second-order adjoint processes are transformed into a simpler form using 
a logarithmic transformation derived in [12]. 

Optimal control of partially observed diffusions (without mean-field coupling) has 
been studied by many authors including the non-exhaustive references [3, 4, 8-10, 
13, 14, 16, 19, 21, 23, 26, 27], using both the DPP and SMP approaches. Reference 
[23] derives an SMP for the most general model of optimal control of partially 
Observed diffusions under risk-neutral performance functionals. Recently, Wang et al. 
[25], extended the SMP for partially observable optimal control of diffusions for risk- 
neutral performance functionals of mean-field type. 

The purpose of this paper is to establish a stochastic maximum principle for a 
class of risk-sensitive mean-field type control problems under partial observation. 
Following the above mentioned papers of optimal control under partial observation, 
in particular [23], our strategy is to transform the partially observable control problem 
into a completely observable one and then apply the approach suggested in [11] to 
derive a suitable risk-sensitive SMP. To the best to our knowledge, the risk-sensitive 
maximum principle under partial observation without passing through the DPP, and 
in particular, for mean-field type controls was not established in earlier works. 

The paper is organized as follows. In Sect. 2, we present the model and state the 
partially observable risk-sensitive SMP which constitutes the main result, whose 
proof is given in Sect.3. Finally, in Sect. 4, we apply the risk-sensitive SMP to 
the linear-exponential-quadratic setup under partial observation. To streamline the 
presentation, we only consider the one-dimensional case. The extension to the mul- 
tidimensional case is by now straightforward. Furthermore, we consider diffusion 
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models where the control enters only the drift coefficient, which leads to an SMP 
with only one pair of adjoint processes. The general Peng-type SMP can be derived 
following e.g. [11, 23]. 


2 Statement of the Problem 


Let T > 0 be a fixed time horizon and (42, ¥, F, IP) be a given filtered probability 
space on which there are defined two independent standard one-dimensional Brown- 
ian motions W = {W;}s>0 and Y = {Ys}s>o. Let FN and FY be the P-completed 
natural filtrations generated by W and Y, respectively. Set IF" := {FY ,O<s<T} 
and F := {.Y%,, 0 < s < T}, where, F, = FY V FY. 


We consider a mean-field type version the stochastic controlled system with partial 
observation considered in [23] which is an extension of the model considered by 
[4, 14] to which we refer for further details. 


The model is defined as follows. 


(i) An admissible control u is an IF" -adapted process with values in a non-empty 
subset (not necessarily convex) U of IR and satisfies EH |u(t)|*dt] < oo. We 
denote the set of all admissible controls by Y%. The control u is called partially 
observable. 


(ii) Given a control process u € WY, we consider the signal-observation pair 
(x", Y) which satisfies the following SDE of mean-field type 


dx" (t) = b(t, x" (t), E[x" (0)], u(t))dt + o (t, x" (t), Ex“ (0) Da W, 
+ a(t, x" (t), E[x" dW“, x"(0) = xo, (1) 
dY, = P(t, x"(t))dt - aW?, Yo = 0, 


where, 
b(t,x,m,u) : [O, T] x R x R x U — R, 


a(t,x,m), o(t,x,m): [0, T] x R x IR — R 
and (t, x) : [O, T] x R — R are Borel measurable function. 


In this model, the observation process Y , which carries out the controls u, is assumed 
to be a given Brownian motion independent of W and is supposed to admit a decom- 
position as a trend h B(t, x" (t))dt (a functional of the state process x") corrupted 
by a process W" which are a priori not observable. The case o = 0 corresponds to 
the model considered in [4, 14]. A more general model of the function 6 would be 
to let it depend on the control u and be of mean-field type. To keep the presentation 
simpler, we skip this case in this paper. But, the main results do extend to this case. 
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Before we formulate the control problem, we show that the system (1) has a weak 
solution. Introduce the density process defined on (2, F, IF, IP) by 


t 1 t 
p“ (t) := exp f p(s, x" (s))dY; — a IB Gs, x“(s))|7ds }, 
0 0 
which solves the linear SDE 


dp" (t) = pP" (ÐP, x" ())dY;,, p“) = 1. 


Assuming the function 6 bounded (see Assumption 1, below), p is a uniformly 
integrable martingale such that, for every p > 2, 


E[ sup p1zx C, Q) 
OxtxT 


where, C is a constant which depends only on the bound of f, p and T. Define 
dP” = p"(T)dP. By Girsanov's Theorem, IP" is a probability measure. Moreover, 


W“ is a IP"-standard Brownian motion independent of W. This in turn entails that 
(IP", x", Y, W, W”) is a weak solution of (1). 

The objective is to characterize admissible controls which minimize the risk- 
sensitive cost functional given by 


T 
J? (u(-)) = E" E (e p f(t, x" (t), E"[x" (0)], u(t) dt + h(x" (T), zr] : 
0 
where, 0 is the risk-sensitivity index, 


f(t,x,m,u) : [OL T] x R x R xU — R, 
h(x,m): R x IR — R. 


Any u(-) € Y which satisfies 


VUO = inf J'u) (3) 
u(-)e47 


is called a risk-sensitive optimal control under partial observation. 
Let Wr = d f(t, x(t), E"[x(t)], u(t))dt + h(x(T), E"[x(T)]) and consider 
the payoff functional given by 


~ 1 
V, i= T log E" e?" , 


When the risk-sensitive index 0 is small, the loss functional Wo can be expanded as 


0 
E"[Vr] + z var (Pr) + O (0), 
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where, var, (Vr ) denotes the variance of V7 w.r.t. IP". If < 0 , the variance of V7, 
as a measure of risk, improves the performance Øo, in which case the optimizer is 
called risk seeker. But, when 0 > 0, the variance of Wr worsens the performance 
Wp , in which case the optimizer is called risk averse. The risk-neutral loss functional 
E"[Wr.] can be seen as a limit of risk-sensitive functional Wo when 0 — 0. 


Since dP” = p" (T)dPP, the associated risk-sensitive cost functional becomes 


Ja) =E | p cree fx" (0), ELp" (0a Ete eee) l 
(4) 


where, on (£2, F, F, IP), the process (o", x") satisfies the following dynamics: 


dp" (t) = p"(t)B(, x" (s))dY;, 
dx" (t) = (bt, x" (t), E[x" ()], w(0) — a (t, x” (t), Ex" ODECE, x" (0)) at 
+ ot, x" (0), Ex" GQ)DaW, + a(t, x" (t), E[x" (0pdY;, 
p"(0) = 1, x"(0) = xo. 
(5) 


We have recast the partially observable control problem into the following completely 
observable control problem: Minimize J ? (1 (-)) defined by (4) subject to (5). 


The main result of this paper is a stochastic maximum principle (SMP) in terms of 
necessary optimality conditions for the problem (3) subject to (5). 


We will only consider the case where the risk-sensitive parameter is positive, 0 > 0. 
The case 0. < 0 can be treated in a similar fashion by considering 0 = —0,6 > 0, 
and f = — f, h = —h in the performance functional (4). 


We will make the following assumption. 


Assumption 1 The functions b, c, a, B, f, h are twice continuously differentiable 
with respect to (x, m). Moreover, these functions and their first derivatives with 
respect to (x, m) are bounded and continuous in (x, m, u). 


To keep the presentation less technical, we impose these assumptions although they 
are restrictive and can be made weaker. 

Under these assumptions, in view of Girsanov's theorem and [18], Proposition 1.2., 
for each u € WY, the SDE (5) admits a unique weak solution (p”, x"). 

We now state an SMP to characterize optimal controls 4(-) € % which minimize 
(4), subject to (5). Let (p, x) := (o*. x") denote the corresponding state process, 
defined as the solution of (5). 
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We introduce the following notation. 


__{P p eee E (Y 
oe). i: xv - Xv (1). B= ( 


òdi 
| 
Lom 
EI 


W,J 
_ (9 _ (eB, x) 0 
F(t, X,m, u) := a m, " j OU ms AA m) E , 


c(t, x, m, u) := b(t, x, m,u) — a(t, x, m)B(t, x), (X) := x, (X) := px, 
$(X) := x, G(X) := px. 


(6) 
We define the risk-neutral Hamiltonian as follows. For (p, q) € R? x R2*2, 
H(t, X, m, p,q, u) := (F(t, X,m, u), p) -tr(G" (t, X, m)q) — f (t, x, m,u), (7) 


where, ‘x’ denotes the transposition operation of a matrix or a vector. 


We also introduce the risk-sensitive Hamiltonian: (p,q, £) € IR? x IR?%? x IR? 


H?’ (t, X, m, u, p,q, ©) := (F(t, X,m,u), p) — f (t, xum, u) 


+tr(G*(t, X mq 40tp. ® 
We have H = H°. 


(^A . (P! EN ng) 
-(9. o= (2) (8) 


q21 922 


Setting 


the explicit form of the Hamiltonian (8) reads 


H?’ (t, X, m,u, p,q, ©) := c(t, x, m,u)po — f (t, x,m,u) + pB(t, x)(qui + 061 pi) 
+ a(t, x, m)(qzi + 062 pi) + o (t, x, m)(q22 + 0€» p2). 


Setting 0 = 0, we obtain the explicit form of the Hamiltonian (7): 


H(t, X,m,u, p,q) :— c(t, x,m,u)po — f(t,x,m,u) + oB(t, x)qu 
+ a(t,x,m)qoi + o (t, x, m)qoo. 


With the obvious notation for the derivatives of the functions b, a, B, o, f, h, w.r.t. 
the arguments x and m, we further set 


HÊ (t, X, m,u, p, q) := cx(t, x, m,u)pa — fx(t, x, m, u) + pf (t, x)(qii + 061p) 
+ Ox (t,x, m)(qo1 + 005 p1) + ox (t, x, m)(q22 + 0€» p2), 
Hy (t, X, m,u, p,q) :— €m(t, x, m, u) po + om(t, x, m)(qo1 + 6271) 
+ Om(t, x, m)(q22 + 00» p»), 
Hp (t, X, m,u, p,q) = B(t,x)(qii 8p). 
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With this notation, the system (5) can be rewritten in the following compact form 


dX(t) = F(t, X(t), El (X (0)], u(0)dt + GG, X(t), EIX (0) Da Bs, 


X(0) = Xo. (9) 


We define the risk-neutral Hamiltonian associated with random variables X such that 
$ (X) and p(X ) are L'(2, ¥, P) as follows (with the obvious abuse of notation): 
For (p, q) € IR? x R???, 


H(t, X, p,q, u) := (F(t, X, E[G(X)], u), p) — f(t, x, EIG(X)], u) (10) 
+tr(G*(t, X, El (X) Da). 


We also introduce the risk-sensitive Hamiltonian: (p,q, £) € IR? x R?*? x R?, 


H° (t, X, u, p,q, £) = (F(t, X, EIGQO], u), p) — f (t, x, EIGQO], u) 


+ tr(G*(t, X, EIOD + 6ep*)). SE) 


For g € (b, c, 0, o, B) and u € U, we set 


gx (t) = 8x (t, X(t), EIX], UO), gm) = Smt, x), EIXO] u()) 12) 


and 
fx) = fr, X(t), ELOX], 40), fm) :— fmit, xO, ELo(t)x(0], uo), 
hy(t) := hx (X(t), ELOX), has) := Am), ELo()xQ) D. 
(13) 
Let 


T. 
Wr := p(T) expe p ft, XE), E[o(t)x(0)], u(0)dt + A(T), EIDE], 


We introduce the adjoint equations involved in the risk-sensitive SMP for our 
control problem. 


dp(t) = — 


H? (t) + rae Ep 0) Ay (1 — 32 ED? (0) fa (] 
HE) + gi ED (Hg (] — ZO ty V(t) fm ON 

+ G(t)(—0e(t)dt + d B), 

dv? (t) = 0v? (r)(£(t), d Bi), (14) 


NEC ANLE 
BT) = ( A ) ( sa) A E[J S hs (T)]. 


v (T) = y$, 


where, in view of (2) and (13), for k € (p, x}, 


250 B. Djehiche and H. Tembine 


He (t) = (Ft, XO, EIOM, (0). BO) — felt, x), ELEA M, HO) 
+ (Get, XQ), EI (X (0)p GO + 6£p* (0) 


and 


Hp (t) = (Fn(t, XO), El) a0). BO) —— 
+ U(G; (t, X(t), El AADAC) + 6£p* (0). 


We note that the processes (P, q, £) may depend on the sensitivity index 0. To ease 
notation, we omit to make this dependence explicit. 


Below, we will show that, under Assumption 1, (14) admits a unique IF-adapted 
solution (p, g, $2, £) such that 


T 
al sup |p(t)/? + sup wor+ f (aie? +009?) «oo (15) 
te[0,T] te[0,T] 0 


Moreover, 
Lemma 1 The process defined on (2, F , F, P) by 
L? y? (t) 


t 02 t 
Pg e (/ 0(£(s), dB,) — 31 Ioas). O0<t<T, (16) 


is a uniformly integrable F-martingale. 


The process L° defines a new probability measure IP^ equivalent to P by setting 
L? = dv. By Girsanov's theorem, the process BP := B,- 0 i £(s)ds, 0 < 
t < T isa IP?-Brownian motion. 


The following theorem is the main result of the paper. Let E?[. ] denote the expec- 
tation w.r.t. P. 


Theorem 1 (Risk-sensitive maximum principle) Let Assumption 1 hold. If the 
process (p(-), x C), u(-)) is an optimal solution of the risk-sensitive control problem 
(3)-(5), then there are two pairs of F-adapted processes (8. £) and (P, ĝ) which 
satisfy (14)-(15), such that 


E?[H* (t, p(t), X (t), P(t), O, L), u) — H(t, A(t), ZA), P(N, AO), L), UO) 77 ] x 0, 


for all u € U, almost every t and P? —almost surely. 


Remark 1 The boundedness assumption of the involved coefficients and their deriv- 
atives imposed in Assumption 1, in Theorem 1, guarantees the solvability of the 
system of forward-backward SDEs (5) and (14). In fact Theorem 1 applies provided 
we can solve system of forward-backward SDEs (5) and (14). A typical example of 
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such a situation is the classical Linear-Quadratic (LQ) control problem (see Sect. 4 
below), in which the involved coefficients are at most quadratic, but not necessarily 
bounded. 


3 Proof of the Main Result 


In this section we give a proof of Theorem 1 here presented in several steps. 


3.1 An Intermediate SMP for Mean-Field Type Control 


In this subsection we first reformulate the risk-sensitive control problem associated 
with (4)-(5) in terms of an augmented state process and terminal payoff problem. 
An intermediate stochastic maximum principle is then obtained by applying the 
SMP obtained in ([1], Theorem 3.1 or [5], Theorem 2.1) for loss functionals without 
running cost. Then, we transform the intermediate first-order adjoint processes to a 
simpler form. The mean-field type control problem for the cost functional (4) under 
the dynamics (5) is equivalent to 


inf E [pce BO MI f (17) 
u()e47 


subject to 


dp(t) = p()BG, x()dY;, 
dx(t) = [b(t, x(t), E[x(()], u(0)) — a(t, xt), EDx (p GE, x@)} dt 
+ a(t, x(t), E[x() DdW, + a(t, x(t), E[x (t) DdY;, (18) 
dE = f(t, x(t), E[o(t)x@], u(t))dt, 
p(0) = 1, x(0) = xo, &(0) = 0. 


We introduce the following notation. 
X 5 Xo, 
(1). Ry = Ro = ( e 


I'(t, R, m) = tear ee 


$(R) = $(X) =x, $R) = (X) = px, (R) = G(X) =X, G(R) = G(X) = px. 


"S 
Il 
Jm ow Dd 
Il 
ne 
wie ox 
V 
I 
I 
Uri si DI 
I 
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With this notation, the system (18) can be rewritten in the following compact form 


_ (E, RO), EIGCR QD], uM) 
dR(t) = es R(t), a a T(t, R(t), E[G(RQ)) Da B;, 
R(0) — Ro 


and the risk-sensitive cost functional (4) becomes 
J^ UO) = Ele (R(T), EIRT), 
where, 
© (R(T), EIÓCRCD)D := p(T) exp @h(x(T), Elo (T) (T) + 687). 


We define the Hamiltonian associated with random variables R such that $(R) € 
L! (2, F, P) as follows. For (p, q) € IR? x R??, 


uc eai C RW), EIA RO), a | 


f(t, RE), E[GCR(Q))], ut) (19) 
T (7. R, E[6(8) Da). 
where, /'* denotes the transpose of the matrix T. 
Setting 
Pi q11 412 
P:=| p |. @:=| 4921 922 |], (20) 
P3 q31 432 


the explicit form of the Hamiltonian (19) reads 


H*(t, p, x, E, pq, u) := H*(t, R, p,g,u) = c(t, x, E[x], u) po + f (t, x, E[ox], u) pa 
+o (t, x, E[xqoo + pf(t, x)qui + a (t, x, E[x qoi. 


In view of (12), we set 


AN (t) :— cx (t) pa(t) + fx E) ps(1) + Ox ()q22(0) + PE) Bx G)aui() + ox (121), 
Ay, (t) :— Cm) p(t) + fm) p3(t) + om ()q22(0) + Am (t)q21 (t), 
HEO = PE, x) qu. 


We may apply the SMP for risk-neutral mean-field type control (cf. [1], Theorem 
3.1 or [5], Theorem 2.1) to the augmented state dynamics (p, x, &) to derive the first 
order adjoint equation: 
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HECE) + ELAS O] + X EL f(t) pA] 


dp(t) — — | HEM) + ELH£ O] + PELL (0 ps 0] | dt + ad Bi, 
0 
M—- - (21) 
(8p(T)) X(T) 
p(T) 2—0y9 | A(T) = )-O | A(T) | Ely hs (T). 
1 0 


This is a system of linear backward SDEs of mean-field type which, in view of 
([6], Theorem 3.1), under Assumption 1, admits a unique F-adapted solution (p, q) 
satisfying 


T 
1 sup |p(Df^ + | ofa eias, (22) 
0 


te[0, 7] 


where, | - | denotes the usual Euclidean norm with appropriate dimension. 


We may apply the SMP for SDEs of mean-field type control from ([1], Theorem 
3.1 or [5], Theorem 2.1) together with the SMP for risk-neutral partially observable 
SDEs derived in ([23], Theorem 2.1) to obtain the following SMP. 


Proposition 1 Let Assumption 1 hold. If (RC), a(-)) is an optimal solution of the 
risk-neutral control problem (17) subject to the dynamics (18), then there is a unique 
pair of F-adapted processes (p,q) which satisfies (21)-(22) such that 


E[H*(t, R(t), p(t), q(t), uw) — H*(t, R(t), p(t), q(t), (0) 77] < 0, 


for all u € U, almost every t and P—almost surely. 


3.2 Transformation of the First Order Adjoint Process 


Although the result of Proposition 1 is a good SMP for the risk-sensitive mean-field 
type control with partial observations, augmenting the state process with the third 
component & yields a system of three adjoint equations that appears complicated to 
solve in concrete situations. In this section we apply the transformation of the adjoint 
processes (p, q) introduced in [11] in such a way to get rid of the third component 
(p3, 431, 932) in (21) and express the SMP in terms of only two adjoint process that 
we denote (P, q), where 


Bis e T (2), distin dg. $21 (23) 
P2 q2 
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Indeed, noting that from (21), we have dp3(t) = (q3(t), d Bj) and p3(T) = —0 we, 
the explicit solution of this backward SDE is 


p3(t) = —0 ELY | Fi] = —0v? (t), (24) 


where, 
V: E[y | Al, O<t <T. 


In particular, we have v0) 2 E [we ]. Therefore, in view of (24), it would be natural 
to choose a transformation of (p, q) into an adjoint process (p, q) , where, 


Pi ji i2 
P= | pr}. d:]| $n ân |. 
P3 931 932 
such that i 
à pat 
t) = =-l, O<t<T. 25 
p3(t) ES Ste (25) 


This would imply that, for almost every 0 < t < T, 


d3(t) = (d31(0), d32(0)) = 0, P— a.s., (26) 


which in turn would reduce the number of adjoint processes to those of the form 
given by (23). 


We consider the following transform: 


" 1 
P(t) ara O<t<T. (27) 
In view of (21), we have 
0p)! x(T) 
p(T) =— hx(T) — | p(T) — El Wp ha (T)]. (28) 
1 o Jr 


We should identify the processes @ and q such that 


df(t) = —à(t)dt + Q(d B,, Q9) 


for which (25) and (26) are satisfied. 
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In order to investigate the properties of these new processes (p, à), the following 
properties of the generic martingale v? , used in [11], are essential. We reproduce them 
here for the sake of completeness. Since, by Assumption 1, f and / are bounded by 
some constant C > 0, we have 


0 < e 0* DC) < y? < el +106 (p. 


Therefore, v? 


is a uniformly integrable IF-martingale satisfying 
Qe Oey eK un, spen 


Hence, in view of (2), we have 


E[ sup WPO < C. (30) 
OxtxT 


Furthermore, the martingale v? enjoys the following useful logarithmic transform 
established in ([12], Proposition 3.1) 


T 
v^ (t) = exp (oz. Ww of f(s, x(s), ELos)x (5)], ids) » 0stzT, GL 
0 


and 
v (0) = Elw7] = exp(8 Zo). 


Moreover, the process Z is the first component of the lF-adapted pair of processes 
(Z, £) which is the unique solution to the following quadratic BSDE: 


dZ, = —(f (t, x (t), EloG)xG)], W(t) + $40 )dt + (0), d Br), 
(32) 
Zr = 4 ln p(T) + hGr, EL(T)x (T). 


where, £(t) = (£1(t), £2(t)) satisfies 


T 
JI eoar] < oo. (33) 
0 


In particular, v? solves the following linear backward SDE 


dv? (t) = 6 (DU), dBi), v (T) = wh. (34) 
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Hence, 


Proof of Lemma 1. In view of (30), 


v? (t) 
v? (0) 


t 2 pt 
= exp (/ 9(£(s), dBs) — =f 1(0)Pas) = L?, O<t<T, (35) 


is a uniformly integrable F-martingale. 


To identify the processes & and q such that 
dp(t) = —à(t)dt + q(t)d Bi, 


we may apply Itó's formula to the process p(t) = 0v? p(t), use (21) and (34) and 
identify the coefficients. We obtain 


Hé(t) + EUR (0 + FEL in) P30) 
A(t) = gg | HE) + ELE) + BOEUS Ops) | + TOLONEN 


4() = zra — OPON). 
Therefore, 


Hg) + EL; O] + xG) ELfm (0 pO] 
dp) = —g | HEO) + ELAS] + HOLL Sn) p3()] | dt + (O4 BP, 


0 
Gt) = Fama — (OU), 
dv? (t) = 0v (t) (£(t), d Bj), 
(0p(T))^! X(T) 
BT) =- h) - 2 y; Eh (T), 


VT) es. 
(37) 


where, B? := B,-0 Jo £(s)ds, Ox t € T, whichis, in view of (35) and Girsanov's 


: : o 
Theorem, a ]P? -Brownian motion, where SES = IL. 
dP | t 


In particular, 


d p3(t) = (q(t), —0£()dt + dB), p3(T) = —1. 
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Therefore, noting that p3(f) := [0»? (0)]7! p3(t) is square-integrable, we obtain 

^ P'ra T 

p3(t) = E [p3(T)| Ar] = —1. 


Thus, its quadratic variation becomes [3 Ida (t) dt = 0, P^ — as. This implies 
that, for almost every 0 < t < T, q3(t) = 0, P? and P — a.s. 


Hence, we can drop the last components from the adjoint processes (p, q) and only 
consider (keeping the same notation) 


zc m ^. (nu 41 
di ai ER Gaal 


for which (37) reduces to the risk-sensitive adjoint equation: 


sey s —1_( P + ELAS C1 - SO EU OI —" 
^P = wn e + ELIE 01 — a TAARE 
40) = gaga) — OPOLA, 
dv (t) = 0v? (E)E), d By), (38) 
seh x (COU) | >) (90) EET 
pT) = -( h,(T) ) — b yz Elvin), 
VT) = yy. 


In view of the uniqueness of lF-adapted pairs (p, q), solution of (21), and the pair 
(v8. £) obtained satisfying (33) and (34), the solution of the system of backward 
SDEs (38) is unique and satisfies (15). 


3.3 Risk-Sensitive Stochastic Maximum Principle 


We may use the transform (27) and (36) to obtain the explicit form (11) of the 
risk-sensitive Hamiltonian H? defined by 


H^ (t, X (t), P(t), ÂO), (0), u) = H*(t, R(t), pQ), qQ), u), (39) 


Ov? (t) 


where, H* is defined by (19). 
Let 


8H*(t) :— H° (t, R(t), p(t), q(t), u) — H° (t, R(t), p(t), q(t), a(t) 
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and 
8H*(r) = H° (t, X(t), PO, q(). £0), u) — H° (t, X (0), PO), GO), €), a). 
We have 
E[8H* OIF] = 0EM? OEH? DIFE] = Ov’ (0) EP [8 H? (t)|.F ], 


where, we recall that v? (t)/v? (0) = L? = dP? /dPP| z. . 


Now, since 6 > 0 and v? (0) = E[v.] > 0, the variational inequality (1) translates 
into 


E?[H* (t, p(t), X (t), p(t), O, L), u) — H? C, A(t), X(t), PO, AO), L), u()L 77] x 0. 


for all u € U, almost every t and IP —almost surely. This finishes the proof of 
Theorem 1. 


4 Illustrative Example: Linear-Quadratic Risk-Sensitive 
Model Under Partial Observation 


To illustrate our approach, we consider a one-dimensional linear diffusion with expo- 
nential quadratic cost functional. Perhaps, the easiest example of a linear-quadratic 
(LQ) risk-sensitive control problem with mean-field coupling is 


inf 2, P u? (dt $x? (T)+uE"[x(T)| 
u()e 


subject to 

dx(t) = (ax(t) + bu(t)) dt + odW, + ad W”, 
dY, = Bx(t)dt +dW*, 

x(0) = X0, Yo = 0, 


where, a, b, a, D, u and o are real constants. 


In this section we will illustrate our approach by only considering the LQ risk- 
sensitive control under partial observation without the mean-field coupling i.e. (u = 
0) so that our result can be compared with [8] where a similar example (in many 
dimensions) is studied using the Dynamic Programming Principle. The case u Æ 0 
can treated in a similar fashion (cf. [11]). 
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We consider the linear-quadratic risk-sensitive control problem: 


inf y(.jeY Enlil eode ta] 
ut- , 


subject to 

dx(t) = (ax(t) + bu(t)) dt + cdW, + ed W^, (40) 
dY, = Bx(t)dt + dw, 

x(0) = xo, Yo = 0, 


where, a, b, a, B and o are real constants. 


An admissible process ((-), x(-), u(-)) satisfying the necessary optimality condi- 
tions of Theorem | is obtained by solving the following system of forward-backward 
SDEs (cf. (5) and (14)) (see Remark 1, above). 


dp(t) = Be(t)x(t)dY,, 
dX(t) = {cx(t) + bu(t)) dt + odW, + adY,, 


0 
dpt) = — (25) dt + q(t) -0t (tdt + d Bj), 
dv? (t) = 9v? (6), d Bj), (41) 


| ((0p(T)! 
p(T) --( I(T) i 
VD) = E. 
p 21 x 


Y; [A p qui 912 
c:—a-—ap, B,:— E 5 = zq i= ; 
PR ee (2) d ( 1 q21 922 


98 = gare o Pedro] 


(0) — xo, 


and the associated risk-sensitive Hamiltonian is 


H? (t, p, x,u, p, q, £) = (cx + bu) py — ju? + ppx(q + 66 p1) 


42 
-Fa(qo1 + 05 pi) + o (q2 + 0€» p2). (42) 


In general the solution (v’, £) primarily gives the correct form of the process £ which 
may be a function of the optimal control u. Inserting £ in the BSDE satisfied by 
(p. q) in the system (41) and solving for (p, q), we arrive at the characterization the 
optimal control of our problem. 
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For the LQ-control problem it turns out that by considering the BSDE satisfied by 
(v?, £), we will find an explicit form of the optimal control u. Indeed, by (31), this 
is equivalent to consider the BSDE satisfied by (Z, £): 


dZ, = —(5i?() + $AG)P3ar + (£0. 4 By), 
Zr = } ln (T) + 1x2. 
Since u is FY, the form of Zr suggests that we characterize u and £ such that 
0 gY 0 y (t) z2 1 = GY 
E'(Z;|F7; ]=E£ E * @ + gj PRU tue] Ost <T, 
where, y and 7 are deterministic functions such that y(T) = 1 and (T) = 0. In 
view of the SDEs satisfied by (p, x) in (41), applying It6’s formula and identifying 


the coefficients, we get 


£1) = (ay (0) + B/0)x(t), A(t) =oy (x(t) (43) 
and 


E*[5 (y (t) + 2(c + oB)y (t) + (6(6? +07) — b*) 7) x2 (27 ] 
HE HA + 40? 4- o?) (t) + ((t) + by (0x (0)? 7] = 0. 


Hence, 


V(t) + 2(c + aB)y (t) + (6(0? +07) - b)y?(r) 20, y(T) 2 1, 


a(t) +42 +y) 20, n(T) =0, 4) 


where, the first equation is the risk-sensitive Riccati equation, and 
E IEE) + by (0x0); ] = 0. 
By the conditional Jensen's inequality, we have 
r- - zY 2 rr- - 2| GY 
E"[u(t) + byOxOlFA; ]| x ETUE by OOV |-7, ]. 
Therefore, the optimal control is 
H) = —by (DE KOF], (45) 

and the optimal dynamics solves the linear SDE 


dx(t) = (cic = Py OE OIF ]) dt +odW,;+adY;, x(0)=x0, (46) 
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where, by the filter equation of Theorem 8.1 in [22], 7r; (x) := E° [x (t)| FY ] is the 
solution of the SDE on (Q2, F, F, IP?): 


t t 
m (E) = xo + n (c — b?y(s))m5(&)ds + l (a + 60v +p) [r6 - 269 |) a? 
0 0 


where, Y? = Y, — fo @wy(s) + B); G)ds is an (2, F, F” , P^)-Brownian motion. 
Inserting the form (43) of £ in the BSDE satisfied by (p, q) in the system (41) and 
solving for (p, q), we arrive at the same characterization the optimal control of our 
problem, obtained as a maximizer of the associated H® given by (42). We sketch the 
main steps and omit the details. 


We have 
Hf —bpi—u, He = Bx(qu+6lipi), He = cp + Bo(qu + 66 pi). 
The BSDE satisfied by (p, q) then reads 


dpi(t) = —{qur@) (Bx) + 0610)) + 004) pi) x(t) + qi2 (£2 (0))) dt 
+ qu(0dY, + qi2(t)dW,, 
dp2(t) = — epa) + Be) (qui) + 061) pi(t))} dt 
+ O(qai€1(t) + q22l2(t))dt + qai(t)dY; + qo2(t)dW;, 
pT) = -gm P2(T) = —3(1). 
(47) 


In view of Theorem 1, if u is an optimal control of the system (40), it is necessary 
that 
E? [bp»(t) — (0.2 ] = 0. 


This yields 
u(t) = bE®[po(t)| F/I. 


The associated state dynamics x solves then the SDE 


dx(t) = [eso + b E*Lp 271] dt + odW, + adY,. 


It remains to compute BE’ pat). ]. Indeed, inserting the form (43) of £ in the 
BSDE satisfied by (p, q) in the system (47), by Itó's formula and identifying the 
coefficients, it is easy to check that (pi (t), q11 (t), qi2(£)) given by 


B x(t) 


pit) := qu(t) := 6 BO’ qi2(t) := 0 


EE 
AON 


solves the first adjoint equation in (47). Furthermore, since p(T) = —x (T), setting 
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E*[p3(0).2] = -AHE OIF], 


where, A is a deterministic function such that A(7) = 1, and identifying the coeffi- 
cients, we find that A satisfies the risk-sensitive Riccati equation in (44). Moreover, 


qai (t) = —0X(t), q(t) = —a(t). 


By uniqueness of the solution of the risk-sensitive Riccati equation in (44), it follows 
that A = y. Therefore, 


E'(pOlF ] = -VOE ROA), gi) =-0yt), gaat) = —ay (t). 
Summing up: the optimal control of the LQ-problem (41) is 
EO) = —by ODE KOLF], (48) 
where, y solves the risk-sensitive Riccati equation 
V(t) + 2c  o)y() + Oo? +a- by AOS, y(T)=1. 49) 


The optimal dynamics solves the linear SDE 
dx(t) = GO — by (r) E [$0127 1) dt +odW, +adY;, x(0)— xo, (50) 


and the filter zr, (X) := E?’ [x (ZY] is solution of the SDE on (22, F, F, p^): 


t 


t 
mG) = xo + [ (c — by Gyms Gyds + I (+ Gay (t) +p) [r6 — 26) |) a? 


where, Y? = Y, — fo (0ay (s) + Bt; Gods is an (2, F, FY , IP^)-Brownian motion. 
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Risk Aversion in Modeling of Cap-and-Trade 
Mechanism and Optimal Design of Emission 
Markets 


Paolo Falbo and Juri Hinz 


Abstract According to theoretical arguments, a properly designed emission trad- 
ing system should help reaching pollution reduction at low social burden based on 
the theoretical work of environmental economists, cap-and-trade systems are put 
into operations all over the world. However, the practice from emissions trading 
yields a real stress test for the underlying theory and reveals a number of its weak 
points. This paper aims to fill the gap between general welfare concepts underly- 
ing understanding of liberalized market and specific issues of real-world emission 
market operation. In our work, we present a novel technique to analyze emission 
market equilibrium in order to address diverse questions in the setting of risk-averse 
market players. Our contribution significantly upgrades all existing models in this 
field, which neglect risk-aversion aspects at the cost of having a wide range of sin- 
gularities in their conclusions, now resolved in our approach. Furthermore, we show 
both how the architecture of an environmental market can be optimized under the 
realistic assumption of risk-aversion. 


Keywords Emission markets * Social burden * Environmental policy * Risk aver- 
sion * General equilibrium * Risk-neutral measure 
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1 Practice of the EU ETS 


A properly designed emission trading system should help reducing pollution reduc- 
tion with low social burden. 

In this paper we understand it as a burden to the society, caused by energy pro- 
duction. We assume that it can be measured in monetary units including both, the 
overall production costs and an appropriately quantified environmental impact of 
energy production. 

Originated from this idea, and based on the theoretical work of environmental 
economists, cap-and-trade systems have been put into operations all over the world. 

The problem of design optimization for emission trading schemes has been 
addressed in [4]. This work shows that, in general, a traditional architecture of 
environmental markets is far from being optimal, meaning that appropriate alter- 
ations may provide significant improvements in emission reduction performances at 
lower social burden. Such improvements can be achieved by extending a regulatory 
framework, which we address below as extended scheme. 

Let us explain this. 

In the traditional scheme, it is assumed that the administrator allocates a pre- 
determined allowance number to the market and sets a compliance date at which a 
penalty must be paid for each unit of pollutant not covered by allowances. Hence, 
the policy maker can exercise merely two controls, the so-called cap (total amount of 
allowances allocated to the market) and the penalty size. In theory, a desired pollution 
reduction can be reached at some costs for the society by an appropriate choice of 
these parameters. However, in practice, there is not much flexibility, since the cap 
is motivated politically and the penalty is determined to provide enough incentives 
for the required pollution reduction. As a result, the performance of the traditional 
scheme could be very poor in terms of social burden for the achieved reduction. 

In an extended scheme, the policy maker has much more influence. The regulator 
can tax or subsidize the production in terms of monetary units or in terms of emission 
certificates. These additional controls can be implemented in a technology-sensitive 
way. Doing so, the merit order of technologies can be changed significantly. On this 
account, emission savings, triggered by certificate prices, also become controllable. 
The work [4] illustrates that, by an appropriate choice of additional controls, the 
market can reach a targeted pollution reduction at much lower social burden. 

Although these theoretical findings are sound, intuitive, and practically important, 
the optimization of environmental market architectures could not be brought to a level 
suitable for practical implementation. There are two reasons for this. 


(1) The existing approach [4] is based on the unrealistic assumption that each of 
the market players is non-risk-averse in the sense that it realizes a linear utility 
function. This assumption is not conform with the modern view and creates 
a number of singularities in the model. A priori, it is not even clear which 
conclusions of this work do hold under risk-aversion. 


Risk Aversion in Modeling of Cap-and-Trade Mechanism ... 267 


(2) Although the practical advantage of such market design optimization is obvi- 
ous, policy makers hardly can use the theoretical findings of [4], because their 
quantitative assessment requires optimal control techniques whose numerics is 
difficult. 


In this work, we address both issues, namely: 


(1^) We assume a non-linear utility function for market agents and show several 
properties of the market equilibrium which make market design optimization 
possible. With this, our model is brought in line with standard economic theory 
and is appropriate for further developments. We also emphasize that, to capture 
risk-aversion, a completely new argumentation has been developed. 

(2") We provide our study in a one-period setting. Being accessible without optimal 
control techniques, the results become evident and potentially usable for a broad 
audience, including practitioners and decision makers. 


The paper is organized as follows. Section2 discusses the literature developed 
concerning markets of emission certificates. In Sect. 3 we introduce our equilibrium 
model. Section4 deepens the analysis of the equilibrium. Section5 studies the social 
optimality of the equilibrium and proves that it corresponds to the overall minimum- 
cost policy under a risk-neutral probability distribution. Section 6 discusses some 
perspectives of optimal market design. The final Sect. 7 provides conclusions. 


2 Theory of Marketable Pollution Rights 


The efficiency properties of environmental markets have been first addressed in 
[6. 10], who first advocated the principle that the "environment" is a good that 
can not be “consumed” for free. In particular, Montgomery describes a system of 
tradable certificates issued by a public authority coupled with fixing a cap to the total 
emissions, and, doing so, to force polluting companies paying proportionally to the 
environmental damage generated by their production activity. An emission certificate 
is representative of the permission to emit a given quantity of pollutant without being 
penalized. Companies with low environmental impact can sell excess certificates and 
the resulting revenue represents a general incentive to reduce pollution. Montgomery 
shows that the equilibrium price for a certificate must be driven by the cost of the 
most virtuous company to abate its marginal unit of pollutant. The key result of his 
analysis is that such a system guarantees that the reduction of pollution is distributed 
among the companies efficiently, that is minimizing their total costs. 

After the seminal analysis of Montgomery, which is based on a deterministic and 
static model, the following research has taken the direction to the stochastic and 
multi-period settings. A literature review on the research which has developed after 
Montgomery's work can be found in [14]. A common result shared by all the analyses 
developed so far is that cap-and-trade systems indeed represent the most efficient way 
to reduce and control the environmental damage generated by the industrial activity. 
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Let us mention the contributions which are directly related to our analysis. A 
majority of relatively recent papers [1—5, 11, 13] are related to equilibrium models, 
where risk-neutral individuals optimize the expected value of their profit or cost 
function. The hypothesis of risk-neutrality of the agents is gracefully assumed in 
those contributions, since it significantly simplifies the proof that environmental 
markets are efficient. Some papers have considered explicitly risk-averse decision 
makers. One of them, [9], develops a pricing model for the spot and derivative pricing 
of environmental certificates in a single-period economy. In [7], the authors also 
develop a (multi-period) equilibrium pricing model for contingent claims depending 
on environmental certificates, where risk-averse agents maximize the expected utility 
of their profit function. 


3 One-Period Equilibrium of Emission Market 


To explain the emission price mechanism, we present a market model where a finite 
number of agents, indexed by the set 7, is confronted with abatement of pollution. 
The key assumptions are: 


e We consider a trading scheme in isolation, within a time horizon [0, T], without 
credit transfer from and to other markets. That is, unused emission allowances 
expire worthless. 

e There is no production strategy adjustment within the compliance period [0, T]. 
This means that the agents schedule their production plans for the entire period 
[0, T] at the beginning. Allowances can be traded twice: at time t = 0 at the 
beginning and at time t — T immediately before emission reports are surrendered 
to the regulator. 

e For the sake of simplicity, we set the interest rate to zero. 

e Each agent decides how much energy to produce and how many allowances to 
trade. 


Note that this one-period model is best suited for our needs to explain the core 
mechanism of market operation and to discuss its properties. A generalization to a 
multi-period framework is possible, but it gives no additional insights related to the 
goal of this work. 

The ith agent is specified by the set &! of feasible production plans for the 
generation of energy (electricity) within one time period from ¢ = 0 to t = T. 
Further, we consider the following mappings, defined on &', for each agent i € I: 


& e VIEI), C. ET GO. 


with the interpretation that for production plan &) € &', the values Vi E), (0i i EÒ, 
and Ej (£o) stand for the total production volume, the total production costs, and the 
total carbon dioxide emission, respectively. 
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Production: At time t = 0, each agent i € I faces the energy demand Do € R+ of 
the entire market, the realized electricity price Py € R+, and the emission allowance 
price ay € R}. Bae on this information, each agent decides on its a aia plan 
Ele !, where Z' is the set of feasible production plans. Given tle , at time T, 
agent in the total production costs, 


Cie) € R, (1) 


the production volume 


Vi (&b €R, (2) 


and the total revenue, Po vi (&), from the electricity sold. 


Allowance allocation: We assume that the administrator allocates a pre-determined 
number y, € [0, oo[ of allowances to each agent i. 


So far, we have introduced deterministic quantities. Let us pd turn to uncertainties 
modeled by random variables on the probability space (2, F, P). 


Emission from production: Following the production plan El, the total pollution of 
agent i is expressed as Ei, 5. 


Remark (Randomness in demand and production) The question of randomness in 
energy demand and production deserves a careful argumentation. The reader may 
be confused by the assumption that in our one-period modeling, the time unit may 
correspond to the entire compliance period (which suggests a rather long time), such 
that our assumption on deterministic demand and unflexible production schedule 
appears unrealistic. To ease understanding, one shall imagine an artificial emission 
market model for short time period, say one day until compliance. The point of our 
proposal is that the elements, the arguments and the techniques required to define the 
optimal production plan on a daily basis are the same of those required to identify 
the plan &) over a generalized period [0, T]. The value of this toy model is that it 
allows a straight-forward generalization to the multi-period situation. In our one- 
period modeling, we assume that the nominal energy demand Dy is non-random 
and is observed at the time ¢ = O when production decisions are made. We also 
suppose that the production plan & of each agent is deterministically scheduled at 
time f = 0. This view is in line with the current practice in energy business, where 
a nominal energy production volume along with a detailed schedule of production 
units is planed non-randomly in advance. Of course, the realized energy consump- 
tion deviates from what has been predicted. However, based on our experience in 
energy markets, it does not make sense to include this random factor into equilibrium 
modeling, since all decisions are made on the basis of a non-random demand antici- 
pation and non-random customer's requests for energy delivery. To maintain energy 
consumption fluctuations in real-time, diverse auxiliary mechanisms are used. They 
can be considered as purely technical measures (security of supply by reserve mar- 
gins). For this reason, we believe that it is natural to assume that, although the energy 
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demand Dy is known and production plan &) is deterministically scheduled at time 
t = 0, the total emission, associated with this production can not be predicted with 
certainty at time t = 0 when the production and trading decisions are made. In fact, in 
practice the producers have to manage diverse source of randomness while following 
production, (demand fluctuation, outages of generators) which yields usually small 
but unpredictable deviations N ! from the nominal emission E) (&b associated with 
production plan &. Thus, let us agree that E a (&b is modeled as a random variable 
given as a sum 

ELED = EED +N, eg, icl (3) 


with deterministic function 
E:S oR, the ELGD 


describing the dependence of the nominal emission on the production plan and a 
random variable N' standing for the deviation from the nominal emission. Note that 
the random emission £7. (55) will be the only source of uncertainty in our model. 


To ease our analysis, let us agree on the natural assumption that for production 
schedules &) € E; i € I the total market emission >°,., E7(&) possesses no point 
masses: 


PO E (E) =z)=0  forallz eR. (4) 


iel 


Allowance trading: At times t = 0, T the allowance permits can be exchanged 
between agents by trading at the prices Ao and Ar, respectively. Denote by Vå, 97. 
the change at times t = 0, T of the allowance number held by agent i € I. Such 
trading yields a revenue, which is 


— vÅ Ao — 9L Ar. (5) 


Note that vå and Ao are deterministic, whereas vi and Ar are modeled as random 
variables. Observe that sales are described by negative values of i, vi, therefore 
(5) is non-negative random variable, if permits are sold. 


Penalty payment: As mentioned above, the penalty z € [0, J-oo[ must be paid at 
maturity T' for each unit of pollutant not covered by allowances. Given the changes 
at times t = 0, T due to allowance trading, i.e. vå and vi, the production éi, and 
the total number Yó of allowances allocated to agent i € 7, the loss of agent i due to 
potential penalty payment is given by 


n (El) — 9) — oF — yi)”. (6) 


Individual profit: In view of (1)-(6), the profit of agent i € 7 following trading and 
production strategy (0', ') = (09, r, £g) depends on the market prices (A, P) = 
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(Ao, Ar, Po) for allowances and energy and is given by 


L^ (9!, E) = ah Ag — OF Ar — CAE + Povi Ei) 
=r (ET (i) — 0) — 01 — yi)”. 


Note that the individual profit could be negative. 


Risk-aversion and rational behavior: Suppose that the risk attitudes of each agent 
i € I are described by a pre-specified strictly increasing utility function U! : R > R. 
With this, the rational behavior of the agent i is targeted on the maximization of the 
functional 


(04, E) EU (L^? (9! , £))) 


over all the possible trading and production strategies (2^, £^) = Wi, vi, Ei). 


Energy demand: Suppose that at time t = 0 all agents observe the total energy 
demand, which is described by Do € R+. Let us agree that the demand must be 
covered. 


Market equilibrium: Following standard apprehension, a realistic market state is 
described by the so-called equilibrium—a situation where all allowance prices, all 
allowance positions, and all production decisions are such that each agent is satisfied 
by its own policy and, at the same time, natural restrictions are fulfilled. 


Definition 1 Given energy demand Do € R+, the prices (A*, P*) = (Aj, Aj. 
Pj) are called equilibrium prices, if, for each agent i € J, there exists a strategy 
(9/*, gi*) = (9i*. de a) such that: 
(i) the energy demand is covered 
> vii) = Do, 
iel 
(i) the emission certificates are in zero net supply 
»3 or =0 almost surely for? = Oandt = T, (7) 
iel 


(ii) each agent i € J is satisfied by its own policy in the sense that 


ECU (LA^! oP, £y) > EULA el, ey (8) 


holds for any alternative strategy (9^, £'). 


The main objective of this section is to prove that in the present model the electricity 
price formation is determined by the usual merit order arguments, where the effect of 
emission regulation causes emission allowance prices to enter the costs of production 


272 P. Falbo and J. Hinz 


at the specific emission rate. This issue can be considered as the core mechanism of 
any cap-and-trade system, since including pollution costs into final product prices 
causes a change of the merit order of production technologies towards a cleaner 
production. To formulate this result, let us elaborate on the opportunity costs and 
introduce additional definitions. 


Opportunity costs: In the economic literature, they stand for the forgone benefit 
from using a certain strategy compared to the next best alternative. For example, the 
opportunity costs of farming own land is the amount which could be obtained by 
renting the land to someone else. Let us explain how the opportunity costs necessarily 
lead to windfall profits. 

When facing energy (electricity) generation, producers consider a profit, which 
could be potentially realized when, instead of production, unused emission allowances 
were sold to the market. For instance, if the price of the emission certificates is 12 
€ per tonne of CO2 and the production of one Megawatt-hour (MWh) emits two 
tonnes of CO2 (say, using a coal-fired steam turbine), then the producer must decide 
between two strategies which are equivalent in terms of their emission certificate 
balance: 


e produce and sell one MWh to the market, 
e do not produce this MWh and sell allowances covering two tonnes of CO2. 


In this situation, the opportunity costs of producing one MWh are 2 x 12 = 24 €. 
Obviously, the agent produces energy only if the first strategy is at least as profitable 
as the second one. Thereby, both the production and the opportunity costs must be 
considered in the formation of the electricity market price. Clearly, if the production 
costs of electricity are 30 € per MWh, then the energy will be produced only if its 
price covers both the production and the opportunity costs. Thus electricity can only 
be delivered at a price exceeding 30 +2 x 12 = 54 €. That is, in order to trigger the 
electricity production, the opportunity costs must be added to the production costs. 

In the scientific community, this phenomenon is well-known under the name of 
cost-pass-through. An empirical analysis, see [12] confirms that the strategy of cost- 
pass-through is currently followed by the European energy producers. Furthermore, 
the detailed investigation of mathematical market models shows that the cost-pass- 
through is the only possible strategy in the so-called equilibrium state of the market. 
This can be interpreted as follows: when behaving optimally, the energy producers 
must pass the allowance price on to the consumers. Note that the producer obtains a 
windfall profit of 24 € in anycase: if electricity price is higher than 54, by passing- 
through the price of the certificates (that he has received for free); if the price is less 
than 54, by selling 2 certificates at 12 € on the emission market. 

More importantly, it turns out that the cost-pass-through is nothing but the core 
mechanism responsible for the emission savings. Namely, due to the opportunity 
costs, clean technologies appear cheaper than emission-intense production strategies. 
For instance, the alternative generation technology represented by a gas turbine, 
which yields energy at the price of 40 € and emits only one tonne of CO2, hardly 
competes with a coal-fired steam turbine under generic regime (without emissions 
regulation). Namely, if there is no regulatory framework, then the coal-fired steam 
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turbine is scheduled first and the gas turbine has to wait until the energy demand 
can not be covered by coal-fired steam technologies. However, given an emission 
regulation, the opposite is true: say, if the allowance price is equal to 12 € per tonne 
of CO2 as above, then the gas technology appears cheaper, operating at full costs 
of 40 4- 1 x 12 — 52 €. Thus, the gas turbine is scheduled first, followed by the 
coal-fired steam turbine, which runs only if the installed gas turbine capacity does 
not cover the energy demand. 

In the next section, we will show that the only rational behavior in equilibrium 
is to pass the opportunity costs on to the consumers. For this, we require additional 
notions. 


Definition 2 Consider a given energy amount d € R+ and a given allowance price 
ae R+. 
(1) Introduce the individual opportunity merit order costs of agent i € I as 


€! (d, a) = inf (ChE) + aEQGD : & e £', Vi(é >d}. 


An individual production plan &) € &' is called conform with opportunity costs at 
emission price a € IR, if 


€ (Vi b, a) = COED + aElGD. 


that is to say &) is confirm if it minimizes the production and emission costs among 
all the alternative plans offering the same generation and given emission price a. 
(i1) Introduce the cumulative opportunity merit order costs as 


€(d,a) = inf{ >“ (Ch) + aEj(&)) :  € 3, ied, Y vue d 


iel iel 


The production plans & € &',i € I, are called conform with opportunity costs at 
emission price a € R+ if 


ED VIE). a) =D (GED + «Ey&b). 
iel iel 
(iii) Any price p € Rx with the property that 
— (d, a) + pd x —(d,a) + pd for alld € Ry (9) 


is referred to as an opportunity merit order electricity price at (d, a) and it can be 
understand as the marginal cost for the entire generation sector given the level of 
demand d and emission price a. 
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4 Properties of Equilibrium 


With these definitions, we now show that, within any equilibrium, the production 
plans are always conform with opportunity costs. Furthermore, the equilibrium elec- 
tricity price is always an opportunity merit order price. 


Proposition 1 Given energy demand Dp, let (A*, P*) = (Aj, Az, Po) be the equi- 
librium prices with the corresponding strategies (3'*, £'*), i € I, then the following 
points hold: 

(i) For each agent i € I, the individual production plan &* is conform with oppor- 
tunity costs at emission price Aj: 


v (Vi (E5"), A) = CAEI) + AS EG GS). (10) 


(ii) The market production schedule &l*, i € I, is conform with opportunity costs at 
emission price Aj: 


VOS V5 59. AD = D (CEE) + ASEQ 69 - (11) 


iel iel 
(iii) Py is an opportunity merit order price at (ic, Vi (EN); AQ). 


The direct economic consequence of this mathematical result is that each individual 
will organize its own production strategy by scheduling power production units in 
an increasing price order. Thereby their variable costs (which include the opportu- 
nity costs of using the emission certificates) are considered. Within such a schedule, 
a demand d is satisfied by gradually turning on the most economic plants, until a 
generation level matching d is reached. Furthermore, the above proposition states 
that such schedule is reached not only on the individual level, but also for the entire 
market. Namely, an overall demand d is satisfied by gradually turning on the most eco- 
nomic plants until reaching a production which covers the demand d. Such aggregate 
ordering is usually called merit order, for this reason we call «€! (d, a) and € (d, a) 
(agent's i) opportunity merit order costs and cumulative opportunity merit order 
costs, respectively. 

It is worth noticing that the opportunity merit order electricity price as defined in 
(9) is equal to the marginal cost of generating electricity when the level of demand is 
d given certificate price a. Coupling this property with the merit order production in 
the electricity sector, implies that the most expensive production in the plan E will 
determine the marginal cost at demand level d and emission price a. The opportunity 
merit order electricity price at (d, a) is defined as the lowest price, which is able to 
trigger the required production level. 


Proof (i) Consider the equilibrium strategy (Di. &*) of agenti € I. Assume that the 
agent deviates from this strategy following an alternative production plan éi egi 
However, to keep the same emission credit balance, the difference Eù (&b — Bi, (&*) 
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is traded at the market in addition to as. That is, we change the equilibrium trading 
strategy (QU, gr to an alternative trading strategy Wå, UE) given by 


Di = Di + Ebi) — ELi, oi = ot. 


Note that we have changed only the initial position, from oe to n whereas the 
final position is the same vi ES oe. A direct calculation shows that the profit of this 
alternative strategy (97, i) = (a, ae &) can be written as 


LA Phi gg. Bye LA Phi qi E) + R(&. Ei”), 
i.e. it differs form the original profit L^ (9/*, £/*) by the amount 
R(&j, &j') = Pg (V9 E0) — Vo 69) + (C959) — C969) + AG (Eo 9) — Eo Éo). 
Note that R(&, &*) can not be positive, since otherwise 
L^ Phi cgi, gs LAP i gie. gi*) 


would yield 


«Ui (LAP ag, &)) > (ut (LE P (g!* giyy 


thus contradicting the optimality of the equilibrium strategy (2/*, £/*) (see (8)). Now, 
from Ri, £5") < 0 we conclude that 


— CÌ EÈ) — AGEN (ES) + PAVE (ER) = -cheb — AR EK (ES) + PRVAG) — 02) 


for each & € &!. With this, we conclude the desired assertion (10) as follows: 
any alternative production plan Ei which produces an energy amount Vi EÒ at least 
equal to Vi (£ġ*) must satisfy 


Ci (&) + ASEM GO < Ch(ES) + ASE EÀ). 
Thus, 
CIEE) + ASE} ER) = int(Ch Ei) + A$ELGD : & e &', VEED = viai». 


(ii) Summing up (12) over i € J, yields, for arbitrary choices of &) EH iel, 
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-D (GEP + AEG) + P$ VEG 
iel icl 
>- D (CGD + AsE Gb) + PF DWE). (13) 


icl iel 


From this, we conclude that, for any choice & € El,i € I, of production plans 
satisfying 


> VG) = Y e. 
icl iel 
it holds 
D (CGS) + A565) < D (CHES) + ASEDED)), 
ier icl 
implying the desired assertion (11). _ 
(iii) We need to prove that, for any d € R+, 
—€ (d, Ag) + Pod < -EÈ vii), AD + PED Vo EG"). 
iel iel 
For each choice of production strategies &) € &',i € I, estimate (13), combined 
with (11), gives 
-EO Vi (ES), A + PID Vio? 
iel iel 


>- D (GED +E) + Pf iG. 


iel iel 
In particular, if the strategies are chosen from 
(Gier: e E, ier M Vj) zd} 
iel 
then it holds that 


-EO VES, A + PS SL VIG = -D (CHE) + ASESGD) + Pid. 
icl iel iel 
Passing on the right-hand side of this inequality to 

V (d, A5) = inf($ (COE + ASEQQ) : & € E, iel, 3 V6 = ah, 


iel iel 
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yields the desired assertion 


-EO Vy). AD + Po >” Vo EG") = —V (d, AD) + Pjd. 


iel iel 


Remark The statement (ii) of the above proposition characterizes equilibrium in 
terms of aggregated quantities. Once the equilibrium is reached, the production 
schedule represents the cheapest way to satisfy the demand. From this perspective, 
the reader may conclude that the equilibrium production schedule can be obtained 
as a production plan which minimizes the overall costs among those which cover 
a given demand, indicating that only aggregated quantities do influence the equi- 
librium. However, we shall emphasize that the equilibrium still heavily depends on 
individual ingredients (such as initial endowments and risk aversion), which enter 
through the initial allowance price. 

Now, we show another natural property of the equilibrium allowance prices. It 
turns out that there is no arbitrage allowance trading and that the terminal allowance 
price is digital. 


Proposition 2 Given energy demand Dy, let (A*, P*) = (Ap, A7. Po) be the equi- 
librium prices with the corresponding strategies (0'*, &'*), i € I,. It holds: 

(i) There exists a risk-neutral measure Q* ~ P such that A* = (At, A7) follows a 
martingale with respect to Q*. 

(ii) The terminal allowance price in equilibrium is digital 


* A . 
AT = loy. ELEH- y207 (14) 


Proof (i) According to the first fundamental theorem of asset pricing, see [8] in 
discrete-time setting, the existence of the so-called equivalent martingale measure 
satisfying Að = zQ" (Ar) is ensured by the absence of arbitrage. Fortunately, in 
our framework, the absence of arbitrage follows from the equilibrium notion, as we 
show next. We thus conclude (i) of the above theorem and it remains to verify that the 
equilibrium rules out all arbitrage opportunities for allowance trading. Let us follow 
an indirect proof, assuming that vo is an arbitrage allowance trading, meaning that 


P(vp(AF — Ag) = 0) =1, P(w(AT — A5) > 0) > 0. (15) 


Based on this we obtain a contradiction by showing that each agent i can change its 
own policy (2/*, £/*) to an improved strategy (2', £/*) satisfying 


z (v' (aa, g™)) <E (va^ d, £*)) . (16) 


The improvement is achieved by incorporating arbitrage vo into the allowance trading 
of each agent i as follows: 


i vi“ + vo, OF = OF — v. 
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Indeed, the revenue improvement from allowance trading is 
—8lA$ — OLAS = —0) AS — 0LAT. + vol AT. — AD), 
which we combine with (15) to see that 
P (an, git) < DC) =1, P (140%, 8) e Dp dn) sei 


which implies (16), therefore contradicting the optimality of (9/*, £/*). 
(ii) From equilibrium property (8), it follows that for almost each w € «2 the terminal 
allowance position adjustment 27 (w) is a maximizer on R to 


z> Ab) — n (ELEF œ) — 00 — yi =z)". (17) 


Note that a maximizer of this mapping exists only if 0 < Až (œ) < x. That is, the 
terminal allowance price in equilibrium must be within the interval A7. € [0,7] 
almost surely. Let us show now that the price actually attains only boundary values 
almost surely, i.e. 


AT €(0,z] almost surely. (18) 


Suppose that an intermediate value A}.(w) €]0, 2[ is taken, then the unique 
maximizer of function (17) is attained on E A GC) — p" — yj This implies that 
vi (w) = Ei, (&5*)(o) — a — Yó holds for each i € J, and a summation over i 
yields 


> PFO) => (ErENO) - oit - €) = D EEN) — vo. 
iel iel je 


Note that equilibrium property (7) ensures that the random variable on the left-hand 
side of the above equality is zero almost surely. Thus, the inclusion 


(o :AT. €]0, m1} € (o : S Ep (59) — yo = 0} (19) 
iel 


holds almost surely. Because of (4), the probability of the event on the right-hand 
side of the above inclusion is zero, which shows (18). 

If A7.(o) = 0, then a maximizer 91* (a) to the function (17) is attained on 
[Et Ei w) — 9i — Mis -Foo[. Hence 


[o :A*% = 0} € (o : EL (E) — of — yi < oF 


holds almost surely for each i € 7, which implies that 
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(o :A$ = 0} € (o: 3 ERE — yo € >) OF} 
iel iel 
holds almost surely. Now, because of the equilibrium property (7), we obtain the 
almost sure inclusion 


(o :Aj = 0) € (0: S EREE) — yo < 0). 
iel 
Since the probability of A7. €]0, 7 [ is zero (19), we conclude for the complementary 
event that 


(o :A} = 1) 2 (o: S EREI) — yo = 0) (20) 


iel 


holds almost surely. Let us show the opposite inclusion. If Až (œ) = x, then a 
maximizer pP (c) to function (17) is attained on ] — co, E 3 Eio) — vi“ — y. 
Hence, 


(o :A} = 2) € (Er (89) — 99 — yo = OF 
holds almost surely for each i € Z, which implies that 
(o A} = 1) € (o: >) ELE) - Yo = > OF) 
iel iel 
holds almost surely. Now, because of the equilibrium property (7), we obtain 
(o :A} = 1) € (o: X EREE) — Yo = 0]. (21) 
iel 


Finally, combine inclusions (20) and (21) to obtain assertion (14). 


5 Social Optimality 


To formulate social optimality, we require additional notations. Given production 
strategies él € &',i e I, we denote the overall market production schedule by 
& = (&b icy and introduce the total production costs Co, the total production volume 
Vo, and the total carbon dioxide emission Erg and the total nominal carbon dioxide 


emission, defined by 


Coko) = >) ChE), VEo) = >) VV EG), Er (Go) = M E Ei, Eo(Eo) = M EED. 


iel iel tel iel 
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Having in mind that Co(£o) stands for the overall costs of the production and inter- 
preting zt (Er (£9) — yo)* as a proxy of the environmental impact of the production 
schedule &, let us agree that 


B(&o) = Co(&o) + x (Er (£o) — yo)* 


expresses the social burden caused by the overall production plan £y € x;e; E'. 
It turns out that the equilibrium strategy minimizes the social burden among all 
production strategies which cover a given demand. 


Proposition 3 Given energy demand Do, let (A*, P*) = (Aj, Az, PJ) be the equi- 
librium prices with the corresponding strategies (0'*, £'*), i € I. Let Q* bea 
risk-neutral measure whose existence is shown in Proposition 2. Then 


E (BER) < EZ" (BE)) (22) 


holds for each production schedule &) = (&Diei € xier Zİ which yields at least the 
same production volume, Vo(&o) = Vo(&p) = Do. 


Proof For each convex function f : IR — R, x — f(x), itholds f(x) -- V f (x)h < 
f(x +h), h € R, where V f (x) stands for one of the sub-gradients of f at the 
point x. In particular, for convex function f : R > Ri, x > x*, we obtain 
x* +150) < (x+ h)* forall x, h € R. With the equilibrium production strategy 
& = (&")ier, we conclude that 


(Er (&) — vo)" + Ler (&)—yo>0} (Er (50) — Er ($9)) < (Er Go) — yo)* 


holds almost surely for any production strategy £y € xje; Z. Using our model 
assumption (3) we deduce Er (£o) — Er (Eğ) = Eo(&o) — Eo(&g) which gives 


(Er (59) — yo)* + Lier (&*)-yo>0) (Eo(&0) — Eo(59)) < (Er (Éo) — yo)*. 


Calculating the expectations with respect to Q* on both sides and multiplying both 
sides by zr, we obtain 


E 


2 (Ger 3) — yo)*)  xE" (Musso) Eoo) — Eo&) < 
< nE% ((Er(&)) — y0)*). 


Using the martingale property and the digital terminal value of the equilibrium 
allowance prices shown in Proposition 2, we finally obtain 


n EV" ((Er(&) — yo)*) + Aj (Eo (£o) — Eo(&)) < tE® ((Er (Eo) — yo)*) . 
Q3) 
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For the case that the strategy &o yields at least the total production volume of the 


equilibrium strategy, Vo(&o) => Vo (&y ), assertion (11) in Proposition 1 yields the 
estimate 


Co(&9) + A5 Eo (EĞ) < Co(&o) + Aj Eo(&o). 


which is equivalent to 


Co(&5) — Co(&o) < Ag(Eo(&o) — Eo(&9. 


Now, combining the last inequality with (23), we obtain 


CoE) + tE™ ((Er Eğ) — yo)*) < Co(&) + EV ((Er (éo) — yo)*). 


which proves our claim (22). 


In Proposition 3, the equilibrium production schedule £j was characterized as a 
solution to the minimization problem 


min(EO (B(£)) : £o € xiejZ',  Vo(&) = Do}. (24) 


Although this fact is about minimization of social burden, it should not be interpreted 
as one of the classical welfare results, which typically follow from equilibrium con- 
siderations. 

An interesting point here is that this type of cost-optimality needs to be taken with 
great care: due to the opportunity cost-pass-through, the consumers can not expect 
that an (inappropriately designed) cap-and-trade mechanism indeed implements the 
cheapest way of emission reduction, from their perspective. 

To see this point, remember that the price per unit of electricity under the merit 
order system includes the opportunity costs of consuming the emission certificates. 
Therefore, given emission price Ao and an overall production schedule £o, the con- 
sumers pay the costs by? 1 AoE HE) to switch in the merit order and to reduce 
emissions. From a global perspective, this costs stands for a wealth re-distribution. 
From the consumer's perspective, it is associated with a burden. 


6 Equilibrium-Like Risk-Neutral Modeling 


Another interesting observation from Proposition 3 is that the expectation 
nO" (B(&)) of the social burden B(&o) is minimized with respect to a risk-neutral 
measure Q* which differs from the objective measure P. The measure Q* is an 
outcome of the equilibrium, and, as such, it heavily depends on the many model 
components, for instance on the risk-aversions, on the certificate endowments, and 
on the production technologies of the agents. However, it is surprising that, once the 
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measure Q* is known, other important equilibrium outcomes can be deduced from 
aggregated quantities only. 

In particular, given Q*, the equilibrium production schedule &j can be obtained as 
the solution of optimization problem (24). Such solution is determined by aggregated 
quantities, since the social burden is by definition B(£o) = Colo) + (Er (£o) — 
yo)* and, apart the quantities yo and zr decided by the authority, it depends only 
on technologies present in the market. Having obtained the equilibrium production 
schedule &j as the solution of optimization problem (24), the equilibrium allowance 
price Aj is calculated applying martingale pricing: 


Aj = nE% (15 9). 


Finally, given the production schedule £j and the allowance price Ağ, also the elec- 
tricity price P% is determined as the marginal price of the most expensive technology, 
which is active in the schedule &}. Note that the opportunity costs must be included 
when identifying the most expensive active technology. 

Summarizing, we conclude that given Q*, merely aggregated market parameters 
are needed to obtain Ej An and PŠ- This observation can be used to establish and to 
analyze realistic equilibrium-like emission market models. Such models are needed, 
since in real emission trading it is nearly impossible to estimate the equilibrium from 
a market model, because the individual parameters are highly undetermined. For 
instance, within the EU ETS, there are more than 25, 000 agents, each with a specific 
production, its own certificate endowment, and a completely unknown risk-aversion. 
On the contrary, the aggregated quantities are well-known, since high-quality market 
data on total allowance allocation and electricity production, including capacities, 
costs, and emission rates, are available. 

In view of this, we suggest an alternative way to estimate the market equilib- 
rium based on aggregated quantities and using an exogenously specified proxy for 
risk-neutral measure Q*. This general approach follows the standard methodology 
of financial mathematics, which successfully describes the stochastic evolution of 
equilibrium prices on financial markets under an appropriately chosen risk-neutral 
measure. 


6.1 Market Equilibrium Under a Risk-Neutral Measure 


We sketch the following program for equilibrium-type modeling of emission markets: 
(1) Determine a risk-neutral measure Q*, which corresponds to an equilibrium situ- 
ation of the emission market in the sense of (i) of Proposition 2. 

(2) Observe that, because of Proposition 3, the equilibrium production schedule £j 
must be a solution to the deterministic optimization problem 
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minimize Co (Go) + mE (Er (€o) — y9)*) e 
subject to Vo(&o) = Do, over éo € x;e1E'. 

To address the problem further, a specification of the space x;e; Æ} of market pro- 

duction strategies along with the functions Co, Vo and Er is required. 

(3) Given the equilibrium production schedule, calculate the total production costs 

Co(&9), the total carbon dioxide emission Er (55), and the energy price P* to assess 

the performance in emission reduction of the current market architecture. 


Remark Note that a risk neutral measure is not unique. Clearly, finding a realistic 
candidate for the risk-neutral measure Q* can be difficult. However, notice that one 
merely needs to specify the fluctuations of the non-predictable emissions under a 
risk-neutral measure. This distribution can be described in a parameter-dependent 
way, which adds flexibility to the model. For instance, having assumed a Gaussian 
framework under objective measure and modeling the density of the risk-neutral mea- 
sure in terms of a Girsanov kernel. Given theoretical initial emission price depending 
on the parameters of the Girsanov kernel, these parameters shall be adjusted to match 
the observed emission prices. Similar techniques have been applied in financial mod- 
eling under the framework of implicit model calibration. Being one of the central 
questions in quantitative finance, the connection between risk-neutral and objective 
measures has been successfully addressed over the recent decades. In view of this 
development, modeling from a risk-neutral measure perspective can be based on a 
variety of different methods, ranging from benchmark approach, estimation of risk 
premia, state price density from portfolio optimization theory, to several econometric 
methods for the estimation of the so-called market price of risk. 


Finally, the performance ofthe cap-and-trade mechanism can be examined leveraging 
on the dependence of the major economic indicators, i.e. total consumers' costs 
Pj Do, total (producers') production costs Co (E. and total carbon dioxide emission 
Eo (gm. on the controls available to the regulator. 

Note that in the standard scheme the regulator controls two key parameters: the 
total allowance allocation yo and the penalty size x. The performance of regulation 
could be assessed in terms of relation between the increase of consumers' costs versus 
the achieved emission reduction. Such analysis may uncover and visualize inappro- 
priate market architectures, where unlucky choices of yo and m cause consumers 
to pay too much, compared to emission savings. Complementary or supplementary 
policies can be evaluated at this point next to the cap-and-trade system. In particular, 
different forms of subsidies and carbon tax mechanisms can have a strong impact on 
the merit order of different technologies. 


7 Conclusions 


Inthis paper, we show how equilibrium analysis and optimization of an environmental 
market can be carried out under the realistic assumption of risk-averse market players. 
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This generalization is based on a novel approach. Thereby, we obtain a number of 
interesting observations, which allow studying equilibrium market situations in terms 
of aggregated market quantities under a risk-neutral measure. Our findings show 
how market design optimization can be achieved incorporating risk-aversion. The 
choice to develop our approach in one-period setting, yields explicit results which 
constructively contribute to better understand the working principles of financial 
instruments and to improve both effectiveness and efficiency of environmental policy. 


Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 
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Exponential Ergodicity 
of the Jump-Diffusion CIR Process 


Peng Jin, Barbara Rüdiger and Chiraz Trabelsi 


Abstract In this paper we study the jump-diffusion CIR process (shorted as JCIR), 
which is an extension of the classical CIR model. The jumps of the JCIR are intro- 
duced with the help of a pure-jump Lévy process (J;, t > 0). Under some suitable 
conditions on the Lévy measure of (J;, t > 0), we derive a lower bound for the tran- 
sition densities of the JCIR process. We also find some sufficient conditions under 
which the function V(x) = x, x > 0,is a Forster-Lyapunov function for the JCIR 
process. This allows us to prove that the JCIR process is exponentially ergodic. 


Keywords CIR model with jumps - Exponential ergodicity * Forster-Lyapunov 
functions * Stochastic differential equations 


MSC: 60H10 - 60J60 


1 Introduction 


The Cox-Ingersoll-Ross model (or CIR model) was introduced in [1] by Cox et al. 
in order to describe the random evolution of interest rates. The CIR model captures 
many features of the real world interest rates. In particular, the interest rate in the 
CIR model is non-negative and mean-reverting. Because of its vast applications in 
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mathematical finance, some extensions of the CIR model have been introduced and 
studied, see e.g. [2, 5, 15]. 

In this paper we study an extension of the CIR model including jumps, the so- 
called jump-diffusion CIR process (shorted as JCIR). The JCIR process is defined as 
the unique strong solution X :— (X+, t > 0) to the following stochastic differential 
equation 

dX, =a(0 — X)dt - o / X,dW; --dJ, Xo 7 0, (1) 


where a, 0 > 0,0 > O are constants, (W;, t > 0) is a 1-dimensional Brownian 
motion and (J;, t > 0) is a pure-jump Lévy process with its Lévy measure v con- 
centrated on (0, oo) and satisfying 


| (E A D)v(d&) < oo, (2) 
(0,00) 


independent of the Brownian motion (W;, t > 0). The initial value Xo is assumed 
to be independent of (W;,t > 0) and (J;, t > 0). We assume that all the above 
processes are defined on some filtered probability space (42, F, (F):>0, P). We 
remark that the existence and uniqueness of strong solutions to (1) are guaranteed 
by [7, Theorem 5.1]. 

The term a(0 — X;) in (1) defines a mean reverting drift pulling the process 
towards its long-term value 0 with a speed of adjustment equal to a. Since the 
diffusion coefficient in the SDE (1) is degenerate at 0 and only positive jumps are 
allowed, the JCIR process (X;, t > 0) stays non-negative if Xo > 0. This fact can be 
shown rigorously with the help of comparison theorems for SDEs, for more details 
we refer the readers to [7]. 

The JCIR defined in (1) includes the basic affine jump-diffusion (or BAJD) as a 
special case, in which the Lévy process (J;, t > 0) takes the form of a compound 
Poisson process with exponentially distributed jumps. The BAJD was introduced by 
Duffie and Gárleanu [2] to describe the dynamics of default intensity. It was also 
used in [5, 12] as a short-rate model. Motivated by some applications in finance, the 
long-time behavior of the BAJD has been well studied. According to [12, Theorem 
3.16] and [10, Proposition 3.1], the BAJD possesses a unique invariant probability 
measure, whose distributional properties were later investigated in [9, 11]. Weremark 
that the results in [10, 11] are very general and hold for a large class of affine process 
with state space R4, where R, denotes the set of all non-negative real numbers. The 
existence and some approximations of the transition densities of the BAJD can be 
found in [6]. A closed formula of the transition densities of the BAJD was recently 
derived in [9]. 

In this paper we are interested in two problems concerning the JCIR defined in (1). 
The first one is to study the transition density estimates of the JCIR. Our first main 
result of this paper is a lower bound on the transition densities of the JCIR. Our idea 
to establish the lower bound of the transition densities is as follows. Like the BAJD, 
the JCIR is also an affine processes in R+. Roughly speaking, affine processes are 
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Markov processes for which the logarithm of the characteristic function of the process 
is affine with respect to the initial state. Affine processes on the canonical state space 
R? x R” have been investigated in [3, 5, 13, 14]. Based on the exponential-affine 
structure of the JCIR, we are able to compute its characteristic function explicitly. 
Moreover, this enables us to represent the distribution of the JCIR as the convolution 
oftwo distributions. The first distribution is known and coincides with the distribution 
of the CIR model. However, the second distribution is more complicated. We will 
give a sufficient condition such that the second distribution is singular at the point 0. 
In this way we derive a lower bound estimate of the transition densities of the JCIR. 

The other problem we consider in this paper is the exponential ergodicity of 
the JCIR. According to the main results of [10] (see also [12]), the JCIR has a 
unique invariant probability measure zr, given that some integrability condition on 
the Lévy measure of (J;, t > 0) is satisfied. Under some sharper assumptions we 
show in this paper that the convergence of the law of the JCIR process to its invariant 
probability measure under the total variation norm is exponentially fast, which is 
called the exponential ergodicity. Our method is the same as in [9], namely we show 
the existence of a Forster-Lyapunov function and then apply the general framework 
of [16-18] to get the exponential ergodicity. 

The remainder of this paper is organized as follows. In Sect.2 we collect some 
key facts on the JCIR and in particular derive its characteristic function. In Sect. 3 we 
study the characteristic function of the JCIR and prove a lower bound of its transition 
densities. In Sect. 4 we show the existence of a Forster-Lyapunov function and the 
exponential ergodicity for the JCIR. 


2 Preliminaries 


In this section we use the exponential-affine structure of the JCIR process to derive 
its characteristic functions. 

We recall that the JCIR process (X;, t > 0) is defined to be the solution to (1) and 
it depends obviously on its initial value Xo. From now on we denote by (X7, t > 0) 
the JCIR process started from an initial point x > 0, namely 


dX? —a(0 — X*)dt Fo / XIdW, +dh, X% — x. (3) 


Since the JCIR process is an affine process, the corresponding characteristic func- 
tions of (X7, t > 0) is of exponential-affine form: 


Efe] = bOO, £e dr = (ue C: Nu x 0j, (4) 
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where Ru denotes the real part of u and the functions ¢ (t, u) and v (t, u) in turn are 
given as solutions of the generalized Riccati equations 


9,0 (t, u) = F (Y(t, u)), $ (0, u) = 0, (5) 
dwt, u) = R(W(t,u)), w (0, u)-uc^, 
with the functions F and R given by 
F(u) = au «f (e! — 1)v(dé), 
(0,00) 
2:2 
R(u) = = au. 
Solving the system (5) gives $ (t, u) and y(t, u) in their explicit forms: 
ue 
y (tu) = 2 (6) 


1— 5zu(l — e7“) 


and 


2 


2a0 o ET d su 
b(t, u) =- log (17 5-u e »«f f, (re )vaas (7) 


Here the complex-valued logarithmic function log(-) is understood to be its main 
branch defined on C \ {0}. For t > 0 and u € YW we define 


—at 


o? — 2a0 xue 
gi(t,u, x) := (1 — —u(0 — e77)) 2? exp ( ; ) 
2a 1 — $zu(1— e-4!) 


qa (L, u) := exp qi l (eu = 1) v(d§)ds), (8) 


where the complex-valued power function z~74°/ c? exp ( — (2a0/c?)log z) is 
also understood to be its main branch defined on C^ {0}. One can notice that g2(t, u) 
resembles the characteristic function of a compound Poisson distribution. 

It follows from (4), (6) and (7) that the characteristic functions of (X7, t > 0) is 
given by 


Ele"X']=gi(t,u,x)gr(t,u), ued, (9) 


where qi (f, u, x) and ¢2(t, u) are defined by (8). 
According to the parameters of the JCIR process we look at two special cases: 
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2.1 Special Case (i): v = 0, No Jumps 


Notice that the case v = 0 corresponds to the classical CIR model (Y;,t > 0) 
satisfying the following stochastic differential equation 


dY? =a(0 — Y)dt - o /Y?dW,, Yi —x20. (10) 


It follows from (9) that the characteristic function of (Y?, t > 0) coincides with 
gi(t,u, x), namely for u € Y 


Efe"*"] = e(t, u, x). (11) 


It is well known that the classical CIR model (Y?,: > 0) has transition density 
functions f(t, x, y) given by 


f(t, x,y) = ce (T) (2(uv)?) (12) 


fort > 0,x > 0 and y > 0, where 


2a 
Km , wake "5, 
o?(1 — e—a) 
2a0 
V=EkKy, = l, 
o 


and /,(-) is the modified Bessel function of the first kind of order q. For x = 0 the 
formula of the density function f (f, x, y) is given by 


— ye” 
f(t,0,y) = T4 D' e (13) 


for t > O and y > 0. 


2.2 Special Case (ii): 0 = 0 and x = 0 
We denote by (Z;, t > 0) the JCIR process given by 
dZ, = —aZıdt - o ZidW, cd, Zo =0. (14) 
In this particular case the characteristic functions of (Z;, t > 0) is equal to g2(t, u), 


namely for u € Y 
E[e"^] = g(t, u). (15) 
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3 A Lower Bound for the Transition Densities of JCIR 


In this section we will find some conditions on the Lévy measure v of (J;, t > 0) 
such that an explicit lower bound for the transition densities of the JCIR process 
given in (3) can be derived. As a first step we show that the law of X7, t > 0, in (3) 
is absolutely continuous with respect to the Lebesgue measure and thus possesses a 
density function. 


Lemma 1 Consider the JCIR process (X? ,t > 0) (started from x > 0) that is 


defined in (3). Then for any t > 0 and x > 0 the law of X; is absolutely contin- 
uous with respect to the Lebesgue measure and thus possesses a density function 
pt, x, y), y z 0. 


Proof As shown in the previous section, it holds 
E[e"*] = qi (t, w x)ga(t, u) = Ele" ]E[e"7], 


therefore the law of X7, denoted by u x», is the convolution of the laws of Y? and Z;. 
Since (Y;*, t > 0) is the well-known CIR process and has transition density functions 
f(t,x,y),t > 0,x, y > 0 with respect to the Lebesgue measure, thus Lx; is also 
absolutely continuous with respect to the Lebesgue measure and possesses a density 
function. 


In order to get a lower bound for the transition densities of the JCIR process we 
need the following lemma. 


Lemma 2 Suppose that Jon EIn(1/é)v(d&) < oo. Then q» defined by (8) is the 
characteristic function of a compound Poisson distribution. In particular, P(Z; = 
0) > O for all t > 0, where (Z;,t > 0) is defined by (14). 


Proof It follows from (6), (8) and (15) that 


t —as 
E[e"^] = galt, u) = exp (/ J, ooj (exp (eaaa) 1)». 


where u € WY. Define 


t E —as 
Ai | Jo hoa) vitas. 


If we rewrite 


£e-es 
TE G = CUP EE 2257) ME cu (Is) 
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where 
u 2aé 
a i= o2(e% — 1) > 0, 
2ae“* (17) 
= ——————— -0, 
PIG» 


then we recognize that the right-hand side of (16) is the characteristic function of a 
Bessel distribution with parameters o and £. Recall that a probability measure Ha,g 
on (Ry, B(R,)) is called a Bessel distribution with parameters o and £ if 


Ua, Bp(dx) = e *óg(dx) + Be *-£* Fee (2,/aBx)dx, (18) 


where do is the Dirac measure at the origin and 7, is the modified Bessel function of 
the first kind, namely 
oo ( Ji r?) k 


hoj= PAE reR. 


For more properties of Bessel distributions we refer the readers to [8, Sect. 3] (see 
also [4, p. 438] and [9, Sect. 3]). Let fig, g denote the characteristic function of the 
Bessel distribution 4a, g with parameters a and f which are defined in (17). It follows 
from [9, Lemma 3.1] that 


» Qu Ee“ u 
fie, p (u) = exp Gea —— ) = exp (— Giada a) 
Therefore 
A= n | TO — 1)v(d£)ds 
0 J(0,00) 
t Qu 
= f | (e pp pe 1)viatds. 
0 J(0,00) 
Set 


X J ol (1 — e7*)v(d£)ds 
(0,00) 
aff ( Lae vec DC (19) 
Ww. 


292 P. Jin et al. 


If A < co, then 


t Qu 
As Fa 67 )v(d£)ds — 2 
EN (e — e) u(deyas 
] ff go gy 
-(f = (eF — e7*)v(d£)ds — 1). 


The fact that à < oo will be shown later in this proof. 

Next we show that the term (1/4) i Jo.) (exp (au /(B—u)) —exp(—a@)) v(d£)ds 
can be viewed as the characteristic function of a probability measure p. To define p, 
we first construct the following measures 


ma p(dx) := Be-*-P* Fr Qafx)dx, x> 0, 


where J; is the modified Bessel function of the first kind. Noticing that the measure 
Mq,p is the absolute continuous component of the measure [g,g in (18), we easily 
get 


ou 


= eh-u — e 


a —a 


, 


fio g(u) = [PT =g 


OO ux 


where Mia, g (u) := Jo e" m, g(dx) for u € Y. Recall that the parameters œ and f 
defined by (17) depend on the variables & and s. We can define a measure p on R+ 


as follows: : 
1 
pium JI | mo,g v(d&)ds. 
^ Jo J(0,o0) 


By the definition of the constant A in (19) we get 


1 t 
: n | ma p (Ra )v(d£)ds 
0 J0,%) 


t 
JU (1— e “)v(dé)ds 
^ Jo J(0,o0) 
1, 


i.e. p is a probability measure on R, and for u € Y 


p (R4) 


ne | e"! p(dx) 
(0,00) 


1 fi ETE 
Ji | (e?-* — e-*)v(d&)ds. 
à Jo J (0,0) 


Thus A = A(f(u) — 1) and E[e"^] = e? 00—UD is the characteristic function of a 
compound Poisson distribution. 
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Now we verify that à < oo. Noticing that 


t 
a= f (1 — e *)v(d&)ds 
(0,00) 
2a& 

-f J ( Loe es re ) v(dé)ds 
(0, m 

=| nn (i-e ze ds (db), 
(0,00) JO 


2 
we introduce the change of variables MEC NA :— y and then get 
o? (ess — 1) 
2a& s 
dy = Ec ds 


= ie x s n 1)as. 


Therefore 
2a£ 
giat ij =y —2& 
A= v(d if (1— e) dy 
I, oo) i 2a&£y + o?y? 
2E 
=| wb T e) dy 
h oo) a 2aéy + o?y? 
=f vas) f gni dy 
(0,00) a 2a£y + a^y 
2(,5at 
— 1 
where ô := PACCO Define 
2a 
2E 


MG) =f U- ea nd 


Then M (&) is continuous on (0, co). As € > 0 we get 


dy 
2a£y + o?y? 


1 oo 
M(é) = dt ro dy +2 / I-e) 


2 
2a&£y --o2y? 


M =l j 
2 DATT a z f ‘nal 27 


2 M M 2 
<24 lart f o 
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2 1 
2E —dy = [ma + 0y)| 


2 
= 2 InQaé + o2) — = In(2aé + a sy 
o2 o2 ó 


^ 


1 1 
< cig + c£ In(—) < c3§ lno) 
H E 
for sufficiently small £, we conclude that 
1 
M(E) < caé Ince), as § > 0. 


If € — co, then 


M) < [o g?) = dy 
Et 2a£y + o?y? 


EL 2 d <a f 1 j 
~ J£ 2a£yco?y? dis ory? y 


$ 


EUM b oral an 


5 


Therefore, 


1 oo 
he cs f E MEWAH $ cs f 1v(d£) < oo. 
0 1 


With the help of the Lemma 2 we can easily prove the following proposition. 


Proposition 1 Let p(t, x, y), t > 0,x, y > 0 denote the transition density of the 
JCIR process (X7,t > 0) defined in (3). Suppose that So. E In(2)v(d£) « oo. 
Then for all t > 0, x, y > 0 we have | 


p(t, x, y) = P(Z; = 0) f (t, x, y) 


where P(Z, = 0) > Oforallt > O and f (t, x, y) are transition densities of the CIR 
process (without jumps). 


Proof According to Lemma 2, we have P(Z; = 0) > 0. Since 


Ele"X'] = gı (t, u, x)gr(t, u) = Ele" ] E[e"7], 
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the law of X7, denoted by u x? is the convolution of the laws of Y? and Z;. Thus 
for all A € A(R+) 


Ax; (A) al Ay; (A — y)uz, (dy) 
R+ 


> f urA -uz dy) 
> uy (A)uz (0) 
> P(Z, = Oy: (A) 


> PU, =0) | ftx vid, 


where f(t, x, y) are the transition densities of the classical CIR process given in 
(12). Since A € A(R+) is arbitrary, we get 


p(t, x, y) 2 P(Zi = 0) f (t, x, y) 


for all t > 0, x, y > 0. 


4 Exponential Ergodicity of JCIR 


In this section we find some sufficient conditions such that the JCIR process is 
exponentially ergodic. We have derived a lower bound for the transition densities of 
the JCIR process in the previous section. Next we show that the function V (x) = x, 
x > 0, is a Forster-Lyapunov function for the JCIR process. 


Lemma 3 Suppose that Ja Ev(dé) < oo. Then the function V(x) = x, x > 0, 
is a Forster-Lyapunov function for the JCIR process defined in (3), in the sense that 
for all t > 0, x > 0, 

E[V(X7))] x e " V(x) + M, 


where 0 « M « oo isa constant. 


Proof We know that wy: = uyy * uz, therefore 
E[X7] = E[Y7] + E[Z;]. 


Since (Y;*,t > 0) is the CIR process starting from x, it is known that Ay; isa 
non-central Chi-squared distribution and thus E[Y?] < oo. Next we show that 
E[Z;] < œ. 

Let u € (—oo, 0). By using Fatou's Lemma we get 
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E[Z; 


-— 


eh] 
= E| lim | 
u—0 u 


"n zn 
< liminf E| | = liminf 
u—>0 u 


Recall that 


E[e"^] = mt, u) = exp ( UL 
bd 


Then we have for all u < 0 
gue“ 


à 
s (oo — (e?/2a)1 a 1) 


Ee“ 


£ue 45 


el- 1-(62/2a)(1—e-45)u € 05) 


Eue 
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1) v(dé)ds) = eA 


—as 


| (1 @?/2ay(1 - eap s G - (0? /2a)(1 — mayu) 


Ee“ 


(1 — (e2/2a)(1 — e-«5)u)? 


IA 


< £e 5 


and further 


t 
E Ee “v(d&)ds < oo. 
0 J7(0,00) 


Thus A(u) is differentiable in u and 


t 
vo -[ f 
0 J (0,06 


It follows that 


—as En dud 
£e “v(d&)ds = = | Ev(dé). 
a (0,00) 


t,u) — t,0 
ELZ,] < liming ZEL — 920.0 
u—0 u 
= daa = 24 4'0) 
u=0 

l—e 


—at 
pee | ev(dé). 
a (0,00) 


Therefore under the assumption f Ev(dé) < oo we have proved that E[Z;] < 
(0,00) 


oo. Furthermore, 


1—e 


ELZ] = + (Ele) 


—at 
= =] Ev(dé). 
u=0 a (0,00) 
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On the other hand, 


—at 


Eje" | = (1 — (o? 2a)u( — e) exp (; = o - 73) 


With a similar argument as above we get 


—0(1—e 9-4 xe. 


u= 


E[Y?] - 2 (Ele”*}) 


Altogether we get 


E[X7] = EIY7] + E[Z.] 


t 
—at 


1-— 
=(1- e *')(0 + E c axe 7 
a 
1 
<O0+—-4+xe™%, 
a 


namely 


1 
E[V(X*)] 0 + A e" V (x). 


Remark 1 If f (1,00) Ev(dé) < oo, then there exists a unique invariant probability 
measure for the JCIR process. This fact follows from [12, Theorem 3.16] and [10, 
Proposition 3.1]. 


Let || - ||ry denote the total-variation norm for signed measures on R+, namely 


lalry = sup {|u(A)]}. 
AcB(R,) 


Let P' (x,-) := P(X} € -) be the distribution of the JCIR process at time ¢ started 
from the initial point x > 0. Now we prove the main result of this paper. 


Theorem 1 Assume that 
1 
f & v(d£) « oo and é In(-)v(d&) < oo. 
(1,00) (0,1) E 


Let x be the unique invariant probability measure for the JCIR process. Then the 
JCIR process is exponentially ergodic, namely there exist constants 0 < B < 1 and 
0 « B « co such that 


IP^G, ) -zlirv € B(x -1)8, £20, xe Ry. (20) 


Proof Basically, we follow the proof of [18, Theorem 6.1]. For any 5 > 0 we 
consider the 5-skeleton chain n} :— X7,,. n € Z,, where Z4 denotes the set of all 
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non-negative integers. Then (7; )nez, is a Markov chain on the state space R+} with 
transition kernel P? (x, -) and starting point ng = x. It is easy to see that the measure 
7 is also an invariant probability measure for the chain (77 )nez,.,x = 0. 

Let V(x) = x, x > 0. It follows from the Markov property and Lemma 3 that 


EU Gj Dij. nt. nl f VOP? M, dy) < eV (ns) + M, 
R+ 


where M is a positive constant. If we set Vo := V and V, := V (nž), n € N, then 
E[Vi] < e Vo(x) + M 


and 
E[V,ilij nt,- n] < eV, +M, neN. 


Now we proceed to show that the chain (7; )nez, , x = 0, is A-irreducible, strong 
aperiodic, and all compact subsets of R., are petite for the chain (7 )nez.,. 

“}-irreducibility”: We show that the Lebesgue measure à on R is an irreducibil- 
ity measure for (7) )nez,. Let A € (R+) and A(A) > 0. Then it follows from 
Proposition 1 that 


Pin} € An] = x] = PE € A) > P(Zs = of Miara, 
A 


since f (8, x, y) > O for any x € R4 and y > 0. This shows that the chain (75 )nez, 
is irreducible with A being an irreducibility measure. 

“Strong aperiodicity" (see [16, p. 561] for a definition): To show the strong aperi- 
odicity of (15 )neZo, we need to find a set B € (R+), a probability measure m with 
m(B) = 1, and € > O such that 


L(x,B)»0, xeR4, (21) 


and 
P(nj € A) > em(A), x EC, Ae Z(R,), (22) 


where L(x, B) := P(n} € B for some n € N). To this end set B := [0, 1] and 
g(y) := infxejo,1] f (8, x, y), y > 0. Since for fixed y > 0 the function f (6, x, y) 
is strictly positive and continuous in x € [0, 1], thus we have g(y) > 0 and 0 < 
Jon g(y)dy < 1. Define 


1 
A = — " | a » r | 
m(A) So. £00dy faas y EZR) 
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Then for any x € [0, 1] and A € Z(R.) we get 


P(n € A) = P(X; € A) 


> P(Z5 = 0 | f (8, x, y)dy 
A 
> P(Z; = D f g(y)dy 
AN(0, 1] 
> P(Z3 = om) | a(y)dy, 
0,1] 


so (22) holds with £ := P(Zs = 0) fig 1, 80)dy. 
Obviously 


L(x, [0, 1]) = PGrj € [0, 1) = P(X; € [0, 1) = P(Z; = 0 | f, x, y)dy > 0 
[0,1] 


for all x € R4, which verifies (21). 

“Compact subsets are petite": We have shown that A is an irreducibility measure 
for (77, )nez,- According to [16, Theorem 3.4(ii)], to show that all compact sets are 
petite, it suffices to prove the Feller property of (7) )nez,, x = 0. But this follows 
from the fact that (77 ),ez , is a skeleton chain of the JCIR process, which is an affine 
process and possess the Feller property. 

According to [16, Theorem 6.3] (see also the proof of [16, Theorem 6.1]), the 
probability measure z is the only invariant probability measure of the chain (7 sez , . 
x > 0, and there exist constants 6 € (0, 1) and C € (0, co) such that 


IP” (x, — zlrv < C(x + 1)8", n € Ly, x € R}. 


Then for the rest of the proof we can proceed as in [18, p. 536] and get the 
inequality (20). 
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Optimal Control of Predictive Mean-Field 
Equations and Applications to Finance 
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Abstract We study a coupled system of controlled stochastic differential equations 
(SDEs) driven by a Brownian motion and a compensated Poisson random measure, 
consisting of a forward SDE in the unknown process X (t) and a predictive mean-field 
backward SDE (BSDE) in the unknowns Y (t), Z(t), K (t, -). The driver of the BSDE 
at time may depend not just upon the unknown processes Y(t), Z(t), K (t, -), but 
also on the predicted future value Y (t + ô), defined by the conditional expectation 
A(t) :— E[Y (t 4-6)|.Z, ]. We give a sufficient and a necessary maximum principle for 
the optimal control of such systems, and then we apply these results to the following 
two problems: (i) Optimal portfolio in a financial market with an insider influenced 
asset price process. (ii) Optimal consumption rate from a cash flow modeled as a 
geometric Itó-Lévy SDE, with respect to predictive recursive utility. 
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1 Introduction 


The purpose of this paper is to introduce and study a pricing model where beliefs 
about the future development of the price process influence its current dynamics. We 
think this can be a realistic assumption in price dynamics where human psychology 
is involved, for example in electricity prices, oil prices and energy markets in general. 
It can also be a natural model of the risky asset price in an insider influenced market. 
See Sect. 5.1. 

We model such price processes as backward stochastic differential equations 
(BSDEs) driven by Brownian motion and a compensated Poisson random measure, 
where the coefficients depend not only of the current values of the unknown processes, 
but also on their predicted future values. These predicted values are expressed mathe- 
matically in terms of conditional expectation, and we therefore name such equations 
predictive mean-field equations. 'To the best of our knowledge such systems have 
never been studied before. 

In applications to portfolio optimization in a financial market where the price 
process is modeled by a predictive mean-field equation, we are led to consider coupled 
systems of forward-backward stochastic differential equations (FBSEDs), where the 
BSDE is of predictive mean-field type. In this paper we study solution methods for 
the optimal control of such systems in terms of maximum principles. Then we apply 
these methods to study 


(i) optimal portfolio in a financial market with an insider influenced asset price 
process. (Sect. 5.1), and 

(ii) optimal consumption rate from a cash flow modeled as a geometric Itó-Lévy 
SDE, with respect to predictive recursive utility (Sect. 5.2). 


2 Formulation of the Problem 


We now present our model in detail. We refer to [5] for information about stochastic 
control of jump diffusions. 

Let B(t) = B(t, w); (t, œ) € [0, 00) x 2 and N (dt, dt) = N (dt, d£) —v(dt)dt 
be a Brownian motion and an independent compensated Poisson random measure, 
respectively, on a filtered probability space (Q2, 2, F 7o. P) satisfying the 
usual conditions. We consider a controlled system of predictive (time-advanced) 
coupled mean-field forward-backward stochastic differential equations (FBSDEs) 
of the form (T > 0 and ô > 0 are given constants) 


e Forward SDE in X (t): 


dX(t) =dX" (t) = b(t, X(t), Y(t), A), Zi), K (t, -), u(t), o)dt 

+o (t, X(t), Y(t), A(t), Z(t), K (t, -), u(t), w)d Bt) 

+ fg v. X), Y (0), AQ), ZE), K, +), u(t), ¢, o) N (dt, d£) ; t € [0, T] 
X(0 =xeER 


Optimal Control of Predictive Mean-Field Equations ... 303 
e Predictive BSDE in Y(t), Z(t), K (t): 


dY(t) = —g(t, X(t), Y(t), A(t), Z(t), K (t, ), u(t), w)dt + Z(t)d B(t) 
+ fie K(t, ¢)N(dt, dz) ; t € [0, T) 
Y(T) =A(X(T),o). 


(1) 
We set 


Y() :2L; te(T, T +ô], (2) 
where L is a given bounded F -measurable random variable, representing a *ceme- 
tery” state of the process Y after time T . The process A(t) represents our predictive 
mean-field term. It is defined by 

A(t) := E[Y (t 8) | A]; t € [0, T]. (3) 
Here Z is the set of functions from Ro :— IR M0] into R, A(x, c) is a C! function 


(with respect to x) from R x €2 into R such that h(x, -) is Zr-measurable for all x, 
and 


g:[O, T] x Rx Rx RxRxZxUx2mR 


is a given function (driver) such that g(t, x, y, a, z, k, u, -) is an F-adapted process 
forall x, y, a, z € R, k e Z andu € U, whichis the set of admissible control values. 
The process u(t) is our control process, assumed to be in a given family YW = g 
of admissible processes, assumed to be cadlag and adapted to a given subfiltration 
G = {G}:>0 of the filtration F, i.e. 4, C F, forall t. The sigma-algebra Y represents 
the information available to the controller at time f. 

We assume that for all u € «7 the coupled system (1)-(3) has a unique solution 
X(t) = X"(t) € L?(m x P), Y(t) = Y"(t) e L(m x P), A(t) = A"(t) € 
L?(m x P), Z(t) = Z"(t) € L?(m x P), K(t, 6) = K" (t,t) e L(m x v x P), 
with X(t), Y (t), A(t) being càdlàg and Z(t), K (t, £) being predictable. Here and 
later m denotes Lebesgue measure on [0, T]. 

To the best of our knowledge this system, (1)-(3), of predictive mean-field FBS- 
DE; has not been studied before. However, the predictive BSDE (1)-(3) is related to 
the time-advanced BSDE which appears as an adjoint equation for stochastic con- 
trol problems of a stochastic differential delay equation. See [7] and the references 
therein. 

The process A(t) models the predicted future value of the state Y at time t + ô. 
Therefore (1)-(3) represent a system where the dynamics of the state is influenced by 
beliefs about the future. This is a natural model for situations where human behavior 
is involved, for example in pricing issues in financial or energy markets. 
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The performance functional associated to u € & is defined by 


T 
JU)=E p f(t, X(t), Y(t), AG), u(t), w)dt + e(X(T), w) + vro] 
(4) 


where f : [0,7] x IgRSRxUx 2—R,o:Rx2—Randvy:RmR 
are given C! functions, with f(t, x, y, a, u, -) being F-adapted for all x, y, a € R, 
u € U. We assume that g(x, -) is .Zr-measurable for all x. 

We study the following predictive mean-field stochastic control problem: 
Find u* € & such that 


sup J(u) = J (u*). (5) 
RZA 


In Sect.3 we give a sufficient and a necessary maximum principle for the optimal 
control of forward-backward predictive mean-field systems of the type above. 

An existence and uniqueness result for predictive mean-field BSDEs is given in 
Sect. 4. 
Then in Sect. 5 we apply the results to the following problems: 


e Portfolio optimization in a market where the stock price is modeled by a predictive 
mean-field BSDE, 
e Optimization of consumption with respect to predictive recursive utility. 


3 Solution Methods for the Stochastic Control Problem 


3.1 A Sufficient Maximum Principle 


For notational simplicity we suppress the dependence of win f, g, h, g and w in the 
sequel. We first give sufficient conditions for optimality of the control u by modifying 


the stochastic maximum principle given in, for example, [6], to our new situation: 
We define the Hamiltonian H : [0, T|xRxRxRxRx#xUxRxRxRxR) > 
R associated to the problem (5) by 


H(t,x,y,a,z ku, p,q,r,à) = f(t, x, y,a,u) + b(t, x, y,a,z,k,u)p+ot,x, y,a,z,k,u)q 


«f y (t, x, y. a, z, k, u, C) N (dt, do) + g(t, x, y, a, z, k, u)À. 
R 
(6) 


We assume that f, b, c, y and g, and hence H, are Fréchet differentiable (C 1) in 
the variables x, y, a, z, k, u and that the Fréchet derivative VH of H with respect 
tok € & as a random measure is absolutely continuous with respect to v, with 
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H 
. Thus, if (V4 H, h) denotes the action of the linear 
v 
operator V; H on the function h € Z we have 


; . . dk 
Radon-Nikodym derivative 


dV. H(£) 
(VH, h) = | h()aVkH(t) = QM D dv(£). (7) 
R dv(¢) 


The associated backward-forward system of equations in the adjoint processes 
p(t), q(t), r(t), A(t) is defined by 


e BSDE in p(t), q(t), r(t): 


dp(t) = -E oar + q(t)d B(t) «f r(t,c)N(dt, dt); O<t<T 
R 
p(T) = 9(X(T) 4 XO)h'(X (T)). 


(8) 


e SDE in à (t): 


dat) = [o + 5 — 8) xis, no) dt + 2 dBA) 
i dVŁH 
[7 (t,¢)N(dt,de); O<t<T (9) 
R 
AO) = wiry, 


where we have used the abbreviated notation 
H(t) = H(t, X(t), Y(t), AM), Z(t), K(t, -), ut), pŒ), qŒ), r(t), A0). 


Note that, in contrast to the time advanced BSDE (1)-(3), (9) is a (forward) stochastic 
differential equation with delay. 


Theorem 1 (Sufficient maximum principle) Let ü uc æ with corresponding solution 
X(t), Y(t), A(t), Z(t), RG, -), p(t), g(t), F(t), Kit) of (1)-(3), (8) and (9). Assume 
the following: 


MT) >0 (10) 


e For all t, the functions 


x — h(x),x > g(x), x > v (x) and 


(x, y, a, z, ku) > H(t, x, y. a, z, k, u, PC), 4G), FO, AW) 


are concave (11) 


306 B. Øksendal and A. Sulem 


e For all t the following holds, 
(The conditional maximum principle) 
ess sup ELH (t, X(t), Y(t), AW), Z(), KG), v, A, PO, FO, Ft) 1G 
veU 


= EIH (t, X(t), Y (0, AW), Z(t). Kt, ), 80, AD, PA, EO, Ft.) | Gls t € [0, T] 
(12) 


dV; H(t, .) 


< oo for all t € [0, T]. (13) 
dv 


Then ù is an optimal control for the problem (5). 


Proof By replacing the terminal time T by an increasing sequence of stopping times 
t, converging to T as n goes to infinity, and arguing as in [6] we see that we may 
assume that all the local martingales appearing in the calculations below are martin- 
gales. 

Much of the proof is similar to the proof of Theorem 3.1 in [6], but due to the 
predictive mean-field feature of the BSDE (1)-(3), there are also essential differences. 
Therefore, for the convenience of the reader, we sketch the whole proof: 

Choose u € © and consider 


J(u) — J(&) — hh t D 5, (14) 
with 
T 
h := E [/ Ufa - foi] dt, LD:—E[ge(X(D) - e(X(7D)], B := vQqQ)-wv 0)), 
: E . : . (15) 
where f (t) = ft, Y (1t), A(t), u(t)) etc., and Y(t) = Y” (t) is the solution of (1)-(3) 


when u = Q, and A(t) = E[Y(t) | F]. 
By the definition of H we have 


T ^ - 
h-E d {H(t) — H(t) — pb) — qe Q) 
- [ře ora, ¿)v(dg) -2030]; (16) 


where we from now on use the abbreviated notation 


H(t) = H(t, X(t), Y(t), AW), Z(), K (t, -), u(t), AW) 
A(t) = H(t, RA), (0, AM, ZO, Kt), â), A/D) 
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b(t) := b(t) — b(t), 
and similarly with X(t) :— X (t) — X (t), Y (t) := Y (t) - Y (0, AW := AD — A), 
etc. 


By concavity of g, (9) and the Itó formula, 


h < E[g (X(T)) X(T)] 
E[B(T) X (T)] — EIXCD)h (X (7) XCT)] 


T T T 
= (z p Pako + | Xd PA) «f q(t)6 (t)dt 
0 0 


«f L 2089720 ova) — EGR(T)h' (X(T) X(TY] 


= ZI pina f wo (-Bo)a 
0 


T T 
«f ÂNE (dt «f | P. Oy. ovali 
0 0 R 
— E[A(T)h'(X(T))X(T)].- (17) 
By concavity of y and h, (10) and the Itó formula we have 
B < E [wro o] = EROF O] 


T T T 
= ElA(T)Y (T) - E i inate + f Poi | a daly, e| 
0 0 


= E[A(T)(h(X(T)) — h(X(T)))] 


T H " T 3 x T LL 
-el f iar + f rodos | dlY, 221 
0 0 
< 


E(A(T)h' (X (T) X(T)] 


PN T aH aH 
+E I iod + | Y(t) 3 (t) (t — 8) xio,r1 (0) | dt 
0 0 y da 


TOH - T f av,H 
" n oF e Z(dt + n | k 
o Oz 0 JR 


c.c, evdo (18) 
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Adding (16), (17) and (18) we get, by (9), 


Jw- JÓ=h+h+h 
T A 9H . dH. 
«zl [no H(t) 3 X(t) Y (t) 
0 x dy 


oH > 9H ~ A E 
= ras — ô) xE TI}OY (t) — ——-OZ0) —(Ve H(t, -), K (t, -)) | J . 
a Oz 
(19) 


Note that, since Y (s) = Y(s) = L fors € (T, T + ô] by (1), we get 


l T 3H : T- 3 Ĥ —. 
E f —— (t — 8)Y (t) xps, ry (dt =a] ——(s)Y (s + ô)ds 
o ða 0 ða 
n E N | | 
au E | ——(s)Y (s + 6) | Fs | dt 
0 da 


l T-b aff " T3ĤÑ NP 
=E n —(s)E[P(s +8) | Fs] as =E f 2^ (s)A(s)ds |. (20) 
0 da o ða 


Substituted into (19) this gives, by concavity of H, 
J(u) — JÓ 2 h-- DI 


d F dH aH n 
«el [no H(t) 5 (X(t) — X(t)) (Y(t) — Y(t) 
0 x dy 


oH ^ oH A 
- (A - ÂO) - 5 (0 - 20) 
a Oz 
— (VÂ (t, ). (K (t, ) — É (t, )) Jar 
«E n 9H (u(t) — àq)dt 
0 Qu 


T 0H 
= | Ely OAO = aw <0, (21) 
0 u 


since u = u(t) maximizes ELR OIZ]. 
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3.2 A Necessary Maximum Principle 


We proceed to prove a partial converse of Theorem 1, in the sense that we give 
necessary conditions for a control ù to be optimal. In this case we can only conclude 
that u(t) is a critical point for the Hamiltonian, not necessarily a maximum point. 
On the other hand, we do not need any concavity assumptions, but instead we need 
some properties of the set 27 of admissible controls, as described below. 


Theorem 2 (Necessary maximum principle) Suppose à € & with associated solu- 
tions X, Y, Z, K, p.d. f, Á of (1)-(3) and (8) and (9). Suppose that for all processes 
B(t) of the form 

B) :— Xp. TIM, Q2) 


where tg € [0, T) and œ = a(w) is a bounded G,.-measurable random variable, 
there exists ô > 0 such that the process 


a(t) -- rB(t) € & for all r € [—6, ô]. 


We assume that the derivative processes defined by 


x(t) = xf = 4 yinbo) 720; (23) 
dr 
ET P 
y(t) = y^ a) = HYPO) lo, (24) 
r 
d s 
a(t) =a’) = — A" (t) 0, Q5) 
dr 
Hou 
z(t) = zf (0) = — Z*'"P Lo, (26) 
dr 
d 5. 
k(t) = PF) = — K*"P (t |o, (27) 
dr 


exist and belong to L?(m x P), L?(m x P), L?(m x P), and L?(m x P x v), 
respectively. 
Moreover, we assume that x(t) satisfies the equation 
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ob ob ob ob 
dx(t) = {Pox +My + Walt) + — zt) + (Veb, klt, -)) 
ax dy da Oz 
4 Posol dt 
u 
00 oc 00 Jo 
di [Fox + (yn + 2 Walt) + 22) + (Veo. k(t, )) 
x dy da dz 
Sono] dB(t) 
u 
0 0 0 0 
«f Bag ox) + Le, ovn+ Le, paw) + Le oe 
R | ox dy da Oz 


a ~ 
HV (t, E), kC, )) + tt, apo N (dt, d£) ; t € [0, T] 


x(0) =0 
(28) 
and that y(t) satisfies the equation 
dye) =- [Sx + $E» + $t (Da) + {OO 
+(Vig(t), ket, )) + SEO) ] at 
+2(t)d B(t) + fg k(t, )N(dt, do); O<t <T (29) 


y(T) =h'(X(T))x(T) 
yt) =0;7T<t<T+6, 


where we have used the abbreviated notation 


0g 0 
a; E 3,8 X, y, d, Z, K, U)x=X (t),y=Y (t),a=A(t),z=Z(t),k=K (t),u=u (t) CLC. 
Then the following, (i) and (ii), are equivalent: 


d 
(i) FAG + rf),2o = 0 for all B of the form (22) 
r 
NY EP 
(ii) P2. Y (t), AC), Z(t), K (t), u, AM)u=acy l6] = 0, 
where (f, Á, Ż, K, À) is the solution of (1), (3) and (9) corresponding to u — ü. 


Proof Asin Theorem 1, by replacing the terminal time T by an increasing sequence 
of stopping times t, converging to T as n goes to infinity, we obtain as in [6] that 
we may assume that all the local martingales appearing in the calculations below are 
martingales. The proof has many similarities with the proof of Theorem 3.2 in [6], 
but since there are some essential differences due to the predictive mean-field term, 
we sketch the whole proof. For simplicity of notation we drop the hats in the sequel, 
i.e. we write u instead of û etc. 
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d 
(1) => (ii): We can write 2 +rB) |j2o— I1 + D + B, where 
F 


T 
h= £ E | n ft, Y^P (ry, ATP e), zwtrBgy, KF, ue) + ZI 
r 0 
r=0 


d 

h = 4 lex" (7))]-=0 
r 
d 

B = wr"? (0))1,—o. 
r 


By our assumptions on f and y we have 


Tra a a a 
h= l [sx + 95 ayy) + 9f Da) + 9f eect) 
0 x dy da az 
a 
HV f (t, +), k(t, )) + Zoso] ar| (30) 
h = E[g (X (T)x(T)] = E[p(T)x(T)] (31) 
I; = y (Y(0)»(0) = AC) y(0). (32) 


By the It6 formula and (28) 


T T T 
h = Elp(T)x(T)] = E | [ p(t)dx(t) + | x(t)dp(t) + | dlp, x10 | 
p ab ab ab ab 
=E | n pit) [oso + (0y() + Zoa + [OO 
0 x dy ða Oz 
+ (Veb(t), k(t, -)) + Zoso] di | ES (-#Zo) dt 
du 0 ax 
Tn 00 ðo ðo 00 
+ n q(t) {Sx + (rny(t) + — (a(n + (qz) 
0 x dy ða az 
0 
(Vio (1), k(t, )) + s ono] dt 
h a a a a 
+f fre Ü [Ze oxo  * o cy) Ea, paw + “Go 
0 JR x dy ða Oz 
ð 
+ < Vey (t, £), k(t) > +a, ono] 121 


T 8b 8 8 3H 
=E p x(t) [a (t)p(t) + E (t)q(t) «f ced (t, O)r(t, Qv(d£) — 20] dt 
0 x Ox R Ox ox 


T b 9 
«f »o [wo ? (r)q(r) 4 n Ka, Dra, vato] d 
0 dy dy R Oy 


F ab do ay 
d a(t) a, pO + oan f z—G. Ort, C)v(d£) | dt 
0 a da m 0a 
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r b a 

«f «0 [oro 00 | SY 6. orl, tv (dt) at 
0 Oz Oz R Oz 
T 

+ n * (k(t, ), Ved) p) + Vo Da) 

+ [ evt. ore vanwaar] 


T 
-=E U x(t) [-3:0 - uos] dt 
0 X 


X 
T H a 
+ i TUE a- 9 —. (996 wo} a 
0 y dy dy 
T dH a 
+ f a (0 — 9 p- e) d 
0 da da da 
T H a 
+ J IE () — Fw - 4998 e) d 
0 Zz Oz Oz 


T 

" n f k(t, DIVEHG) — Vif) — AG Vig (Djv (de dt 
T 

«f po [Eo 9f a) «sto ar] 
0 ðu ðu ðu 


T a a a 
=~ jj) - l | A(t) EGG + C yq + Ee) 
0 x dy az 


ə 
+(Veg(t), k(t, -)) + oso] Z 


T(8H dH 8H 
+E p [Povo + (z(t) + (VHO), k(t, -)) + aono] ar (33) 
0 y əz ðu 
By the Itô formula and (29), 


T T T 
B A(0)y(0) = E hoso 7 ( n (Ody) + I Yda) + f dià, »0)| 
= EDT] 


a a a a 
-(r p | Oyo E iat) — 2 aze) 
0 y da az 


0 
~(Vig(t), k(t, )) — ato] dt 


T ƏH aH T — 80H 
+f y(0, (dt + y(t) — (t — ô) xis, r (t)dt «f z(t) — (t)dt 
0 y da 0 az 


T 
«f n k(t, VL. ovo) ! (34) 
0 R 


Optimal Control of Predictive Mean-Field Equations ... 313 


Adding (30), (33) and (34) and using that 
T ƏH ie dH 
zif y(t) —( — 8) xps, ridt] = gj y(s + 6) — (s)ds] 
0 da 0 da 


T OH T dH 
= E[ n e WEDE + 517,045] = El i. y(t) (asas), 5) 
0 a 0 da 


we get 


d T 3H 
—J(u +r) |== l + h = E p 21 3 
dr 0 Qu 


We conclude that F 
7H + r£) |-=0= 0 
dr 


if and only if 
T oH 
E n os PME — 0 for all bounded £ € & of the form (12.3.17). 
0 u 
Since this holds for all such £, we obtain that if (1) holds, then 


T | aH 
f E | —(t) | Sn | dt = 0 for all to € [0, T). (36) 
to ðu 


oH 
Differentiating with respect to tọ and using continuity of Pr (t), we conclude that 
u 
(ii) holds. 
(ii) > (i): This is proved by reversing the above argument. We omit the details. 


4 Existence and Uniqueness of Predictive Mean-Field 
Equations 


In this section we study the existence and uniqueness of predictive mean-field BSDEs 
in the unknowns Y (t), Z(t), K (t, £) of the form 


dY(t) = —g(t, Y(t), A(t), Z(t), K (t, ), o)dt + Z(t)a B(t) 
+ fg K(t, OÑ (dt, d£) ; t € [0, T) (37) 
Y(t) -—L;teI[T,T +6]; 6 » 0 fixed, 
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where L € L?(P) isa given .Zr measurable random variable, and the process A(t) 
as before is defined by 


A(t) = E[Y(t 3-6) | F] ; t € [0, T]. (38) 


To this end, we can use the same argument which was used to handle a similar, but 
different, time-advanced BSDE in [7]. For completeness we give the details: 


Theorem 3 Suppose the following holds 


T 
zif g^ (t, 0,0, 0, 0)dt] < oo (39) 
0 


There exists a constant C such that 


lg, yi, 41, zi, k1) — g(t, y2, a2, 22, k2)| € C(]y1 — yl + [zr — zal 
d Iki (¢) — ko(t)2v(de))2) (40) 


forallt € [0, T], a.s. Then there exists a unique solution triple (Y (t), Z(t), K (t, £)) 
of (37) such that the following holds: 


Y is cadlag and E[sup,eyo. 7] Y?(r)] < oo, 
Z, K are predictable and El fp ZO + fig K7(t, )v(dz)}dt] < oc. 


Proof We argue backwards, starting with the interval [T — à, T]: 

Step 1. In this interval we have A(t) = E[L|.¥;] and hence we know from the theory 
of classical BSDEs (see e.g. [8, 9] and the references therein), that there exists a 
unique solution triple (Y(t), Z(t), K (t, £)) such that the following holds: 


Y is cadlag and E[sup;etr. 5 7j Y?(t)] < oo; 
Z, K are predictable and ELT. ZO + fg K?(t, )v(d£))dt] < oo. 


Step 2. Next, we continue with the interval [T — 26, T — ô]. For t in this interval, the 
value of Y (t + ô) is known from the previous step and hence A(t) = E[Y (t+ 6)|.F;] 
is known. Moreover, by Stepl the terminal value for this interval, Y (T — ô), is known 
and in L?(P). Hence we can again refer to the theory of classical BSDEs and get a 
unique solution in this interval. 

Step n. We continue this iteration until we have reached the interval [0, T — nô], 
where n is a natural number such that 


T (nd 180 «T-—n 


Combining the solutions from each of the subintervals, we get a solution for the 
whole interval. 
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5 Applications 


In this section we illustrate the results of the previous sections by looking at two 
examples. 


5.1 Optimal Portfolio in an Insider Influenced Market 


In the seminal papers by Kyle [4] and Back [2] it is proved that in a financial market 
consisting of 


e noise traders (where noise is modeled by Brownian motion), 

e an insider who knows the value L of the price of the risky asset at the terminal 
time t = T and 

e a market maker who at any time f clears the market and sets the market price, 


the corresponding equilibrium price process (resulting from the insider's portfolio 

which maximizes her expected profit), will be a Brownian bridge terminating at the 

value L at time t = T. In view of this we see that a predictive mean-field equation 

can be a natural model of the risky asset price in an insider influenced market. 
Accordingly, suppose we have a market with the following two investment pos- 

sibilities: 

e A risk free asset, with unit price So(t) :— 1 for all t 

e A risky asset with unit price S(t) :— Y(t) at time t, given by the predictive mean- 
field equation 


(41) 
Y@) =L@); te[T,T +ô], 


E = —A(t)u(t)dt + Z(t)d B(t); t € [0, T) 
where u(t) = u(t, œ) is a given bounded adapted process and L is a given bounded 


-r-measurable random variable, being the terminal state of the process Y at time 
T. 


Let u(t) be a portfolio, representing the number of risky assets held at time t. We 

assume that G — FF. If we assume that the portfolio is self-financing, the correspond- 

ing wealth process X (t) = X" (t) is given by 

dX (t) = u(t)dY(t) = u(t)A(t)u(t)dt + u(t)Z(t)d B(t); t € [0, T) 

(42) 
X(0)=x >0. 


Let U : [0, oo) — [—00, oo) be a given utility function, assumed to be increasing, 
concave and C! on (0, oo). We study the following portfolio optimization problem: 
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Problem 1 Find u* € . such that 


a E[U (X" (T))] = E[U(X" (T))]. (43) 
ues 


This is a problem of the type discussed in the previous sections, with f = vy = 
N = 0,@ = U and h(x, w) = L(@), and we can apply the maximum principles 
from Sect. 3 to study it. 

By (6) the Hamiltonian gets the form 


H(t,x,y,a,z, ku, p,q,r, A) = uau(t)p + uzq +au(t)à. (44) 


The associated backward-forward system of equations in the adjoint processes 
p(t), q(t), A(t) becomes 


e BSDE in p(t), q(t): 


dpt) =q@dBt); O<t<T (45) 
p(T) =U"(X(T)), 
e SDE in A(t): 
dX(t) = w(t — &)[u(t — 8)p(t — 6) + AC — 8)]xpo ri (dt 
tu(t)qu)dB(t); O<t<T (46) 
A(0) = 0. 
The Hamiltonian can only have a maximum with respect u if 
AM UO pO + ZMq(t) = 0. (47) 
Substituting this into (45) we get 
dp(t) — —6(Dp(DdB(); 0s: T T 
p(T) —U'(X(T), 
where Auw 
uu H 
O(t) := "EC (49) 


From this we get 


t t 
p(t) = cep | 0(s)d B(s) — >| (0(s))"ds);0<t<T (50) 
0 0 
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where the constant 
c = p(0) = E[U'(X (T)] 


remains to be determined. 
In particular, putting t = T in (50) we get 


T 1 T 
U'(X(T)) = p(T) = cexp(- f &G)dBG) - 5 ri (6(s))2ds) 


or 


T 1 T 
X(T) = (U! (cexpC- n 6G)dBG) — 5 [ (0(s))"ds)). 
Define 
T 1 T 
r(T)- e| 0(s)dB(s) — >| (0(s))^ds). 
0 2 Jo 
Then by the Girsanov theorem the measure Q defined on Fr by 


d Q(o) — F(T)dP (o) 


is an equivalent martingale measure for the market (41). Therefore, by (53), 
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(51) 


($2) 


(53) 


(54) 


(55) 


T 1 T 
x = Eg[X(T)] = E((U) ! (c exp(— I 6G)dBG) — 5 f (A(s))?ds))P(T)I. 


(56) 


This equation determines implicitly the value of the constant c and hence by (53) 
the optimal terminal wealth X(T) = X ' (T). To find the corresponding optimal 


portfolio u* we proceed as follows: 
Define 


Zo(t) := u*(t) Z(t). 
Then (X" (rt), Zo(t)) is found by solving the linear BSDE 


AX" (t) = SOK Z0O qt + Zo( dB; 0<t <T 
X" (T) = E(U^)- (cexp(— fy 0(s)d B(s) — 1 fy ((s))°ds) PZ). 


We have proved: 


(57) 


(58) 
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Theorem 4 (Optimal portfolio in an insider influenced market) The optimal port- 
folio u* for the problem (43) is given by 


Zo(t) 
Z(t)’ 


u(t) = (59) 


where Zo(t), Z(t) are the solutions of the BSDEs (41), (58), respectively, and c and 
0 are given by (56) and (49), respectively. 


5.2 Predictive Recursive Utility Maximization 


Consider a cash flow X (t) = X“ (t) given by 


dX(t) = X(O[u(0dt + o (dBA) 
+ fg y(t, ¢)N(dt, dz) — e(0X(04dt; t € [0, T) (60) 
X(0) 2 x » 0. 


Here u(t), o (t), y (t, £) are given bounded adapted processes, while u(t) :— c(t) is 
our control, interpreted as our relative consumption rate from the cash flow. We say 
that c is admissible if c is F-adapted, c(t) > 0 and X€ (t) > 0 for all t € [0, T). We 
put G =F. 

Let Y(t) = Y*(t), Z(t) = Z(t), K(t, &) = K*(t, £) be the solution of the 
predictive mean-field BSDE defined by 


dY (t) = —{a(t)A(t) + In(c(t)X (1) }dt + Z(t)d B(t) 


+ fp KG, N (dt, dé); t € [0, T) (61) 
Y(T) — 0, 


where a(t) > 0 is a given bounded F-adapted process. Then, inspired by classical 
definition of recursive utility in [3], we define Y^(0) to be the predictive recursive 
utility of the relative consumption rate c. 

We now study the following predictive recursive utility maximization problem: 


Problem 2 Find c* € such that 


sup Y*(0) = Y^" (0). (62) 
ceof 


We apply the maximum principle to study this problem. In this case we have 
f =¢=h=0, W(x) = x, and the Hamiltonian becomes 


H(t, x, y,a, z, k, u, p qur, X) = x[(4(0) — c)p + o (a «f ve. C)rG)v(dt, dc)] 


+ [ao (t) + Inc 4- In x]. (63) 
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The associated backward-forward system of equations in the adjoint processes 
p(t), q(t), A(t) becomes 


e BSDE in p(t), q(t): 


dp) = -EHA — ce) p) +o Mat) + fa yE, Ovdt, d£) + Mt 
+q(t)dB(t) + fer(t,¢)N(dt,do); O<t<T 
p(T) =0, 
(64) 
e SDE in A(t): 


X10) es: taa 


[et = a(t -8)( — 8Ixp.ri(Odr ; OS t T 
The delay SDE (65) does not contain any unknown parameters, and it is easily seen 
that it has a unique continuous solution A(t) > 1, which we may consider known. 
We can now proceed along the same lines as in Sect. 5.2 of [1]: Maximizing H 
with respect to c gives the first order condition 


pac (66) 
C DÁM 
X(t) p(t) 
The solution of the linear BSDE (64) is given by 
= a TAOTO, or. 
ropt) = El » es dsl. (67) 


where 


dT (r) 2 TEUA — c())dt + o (dB) fg y. HÑ (dt, dt); O<t<T 
r(0)=1. 
(68) 
Comparing with (60) we see that 


X(t) 2 xT(t); O<t <T. (69) 


Substituting this into (67) we obtain 


T 
POX) = zif A()ds.Z,]; 0<t<T. (70) 
t 


320 B. Øksendal and A. Sulem 


Substituting this into (66) we get the following conclusion: 


Theorem 5 The optimal relative consumption rate c* (t) for the predictive recursive 
utility consumption problem (62) is given by 


A(t) 


c* (f) = —À——À———— 
Elf’ X()ds|Z;] 


;0<t<T, (71) 


where X(t) is the solution of the delay SDE (65). 
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Modelling the Impact of Wind Power 
Production on Electricity Prices 

by Regime-Switching Lévy Semistationary 
Processes 


Almut E.D. Veraart 


Abstract This paper studies the impact of wind power production on electricity 
prices in the European energy market. We propose a new modelling framework 
based on so-called regime-switching Lévy semistationary processes to account for 
forward-looking information consisting of predicted wind power generation. We 
show that our new regime-switching model, where the regime switch depends on the 
so-called wind penetration index, can describe recent electricity price data well. 


1 Introduction 


Renewable sources of energy are of increasing importance in modern energy mar- 
kets. For instance, the European Union has set the target of increasing the share of 
energy from renewable sources by 2020 to 20 %. In the German and Austrian energy 
market, which will be the focus of this paper, the most important source of renew- 
able energy is wind, followed by biogas and solar. Since many renewable sources are 
highly dependent on weather conditions, they tend to increase the volatility of the 
corresponding energy prices. It is hence urgent and important to find reliable models 
which can describe electricity prices in these changing market conditions which can 
be used for risk assessment and management in energy markets. 

The recent literature has presented a variety of both discrete-time and continuous- 
time time series models which promise to describe the stylised facts of energy 
markets, see e.g. [3, 17] for reviews. However, reliable models which incorporate 
information on renewable sources have only recently emerged and have currently 
been restricted to discrete-time models, see e.g. [9—12, 18]. 

This paper contributes to the continuous-time literature by introducing for the first 
time a modelling framework which takes the forward-looking information avail- 
able to market participants through wind production forecasts into account when 
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modelling electricity day-ahead prices. In doing so, it extends recent work by [1] 
who proposed to model electricity spot prices by so-called Lévy semistationary 
processes. Their model consists of a reduced form approach, which only considers 
electricity prices directly and does not take any price information from other fuels 
or commodities into account. Note that other types of forward-looking information, 
such as capacity constraints, have been incorporated in models for electricity prices 
by [8]. Also, [2] have developed a framework for incorporating forward-looking 
information in electricity or weather markets through an enlargement of filtrations 
approach. 

With the increasing power generation through wind farms, we have observed 
that electricity prices at the European Energy Exchange (EEX) started to become 
negative, which happened for the first time in October 2008, and partially even 
exhibited rather extreme negative price spikes, see e.g. [16] and the references therein. 
Also, various articles have argued that increasing wind power production seems to 
decrease the overall price level, but tends to increase the observed volatility in the 
market, see e.g. [12]. These are important findings, which need to be incorporated 
into a modelling framework, one of which will be presented in this paper. 

The outline for the remaining part of this article is as follows. In Sect.2 we give 
a detailed description of the data from the EEX which will be used in our empirical 
analysis and we carry out an exploratory data analysis to motivate the new model we 
are going to introduce in this paper. Section3 contains the main contribution, where 
we introduce the new class of regime-switching Lévy semistationary processes and 
show how they can be calibrated to our empirical data. Finally, Sect. 4 concludes. 


2 Exploratory Data Analysis 


This section presents the results of an exploratory data analysis of electricity price 
and wind production data from the European Energy Exchange, which motivates the 
new modelling framework which we will introduce in Sect. 3. 


2.1 Description of the Data 


Our empirical data analysis focuses on electricity prices and wind data for a time 
period from 01.01.2011 to 31.07.2014, i.e. consisting of 1308 days. 

We consider three sets of data: electricity prices, their corresponding volumes 
(sometimes called loads) and wind production data. More precisely, daily EEX 
Phelix baseload and EEX Phelix baseload volume data (for Germany and Austria) 
were downloaded from Datastream and the EPEX spot website. In addition, we 
downloaded the forecasted wind production data for the four German Transmission 
System Operator (TSOs) (50Hz Transmission, Amprion, Tennet TSO, EnBW 
Transportnetze (Transnet) ) and one Austrian TSO (Verbund (APG)). These data 
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Table 1. Summary statistics of the EEX Phelix baseload from 01.01.2011 to 31.07.2014 


Minimum 1st Quartile Median Mean 3rd Quartile | Maximum 


—56.87 33.87 43.10 41.96 50.69 98.98 


have been aggregated to obtain daily forecasts for the wind production for each of 
the five TSOs. 


2.2 EEX Phelix Baseload Prices 


First of all, we want to explore the specific properties of the time series of the 
electricity price data. We focus on the day-ahead electricity prices determined by a 
daily auction at 12:00 pm, 7 days a week all year (including statutory holidays). The 
underlying quantity to be traded is the electricity for delivery the following day in 
24h intervals. The prices are bounded (currently between [—500, 3000] EUR/MWh). 
The EEX Phelix baseload is obtained as the daily averages of the 24h day-ahead 
prices for Germany and Austria. 

From the summary statistics in Table 1, we notice that the times series does not 
feature any truly extremes spikes, which have occurred in older data sets from the 
EEX market. In addition, we notice that there are negative prices even in the baseload 
prices which consists of the average of the 24h prices, see also Fig. 1. The plot of 
the autocorrelation function reveals a clear weekly pattern which is one of the well- 
known stylised facts of such data. 

When we study the distributional properties of the price data, see Fig. 2, we clearly 
observe that the empirical distribution is not well described by a Gaussian distribu- 
tion, but appears to be asymmetric and features heavier tails, the latter is particularly 
pronounced when we focus on the left tail of the distribution. 
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Fig. 1 Time series plot of the baseload prices (in EUR/MWh) and autocorrelation plot 
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Fig. 2 Distributional properties of the prices: The standardised histogram of the empirical distrib- 
ution and estimated kernel density function of the prices are depicted in the first plot. The second 
plot compares the empirical distribution to a Gaussian distribution via a quantile-quantile plot 


2.3 Predicted Wind Energy Feed-In 


Next we investigate the data from the five Transmission System Operator (TSOs) in 
Germany and Austria. Note that we are studying the one-day ahead predicted wind 
feed-in since we assume that this is the quantity which impacts the one-day ahead 
electricity prices determined in the daily auction. 

Note that for each TSO, the data is available in 15 min intervals, where the unit of 
measurement is Megawatt (MW). In order to get the hourly forecasts, we aggregated 
the data as follows. Let V. j) denote the 15 min wind power forecast for quarter 
i within hour t. We then obtain the hourly forecasts (recorded in Megawatt hours 


(MWh)), denoted by x where t(j) denotes the jth hour on the tth day, from 


(h) (q) 
Vij MW - h] = 5. gH Voo MW]. 
i=l 


Further, we obtain the daily forecasts, denoted by v from 


v? [MW - h] = Sv [MW - h]. 


j=l 


The data are summarised by TSO in Table 2. 

In the following study, we focus on the aggregated time series which accumulates 
the predicted wind feed-in of all five TSOs, see Table3 for the summary statistics 
and Fig. 3 for the time series plot. We clearly observe a yearly cycle in the wind data 
showing that the predicted wind feed-in is always highest during the winter time 
period. 
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Table 2 Forecasted wind feed-in in Gigawatt hours (GWh) (rounded) 


50 Hz Amprion | APG Tennet Transnet | Total 
2011 19490 6687 18 18484 416 45095 
2012 20203 7253 25 20464 246 48191 
2013 19129 7742 31 21259 425 48585 
2014 (Until July) 11648 4799 21 13412 399 30279 


Table 3 Summary statistics of the forecasted aggregated wind production data (in GWh) from 
01.01.2011 to 31.07.2014 


Minimum 1st Quartile Median Mean 3rd Quartile | Maximum 
12510 53870 98940 131600 176000 572600 
Sa T T T T 
Jan 2011 Jan 2012 Jan 2013 Jan 2014 


Fig. 3 Time series of the forecasted aggregated wind production data from 01.01.2011 to 
31.07.2014 reported in GWh 


2.4 Wind Penetration Index 


Jónsson et al. [10] pointed out that there is a non-linear and time dependent relation- 
ship between wind power forecasts and spot prices. Moreover, they found that “it is 
in fact the ratio between the forecasted wind power generation and the forecasted 
load that has the strongest association with the spot prices", see [10, p. 314]. Hence, 
in the following analysis, we will not use the predicted wind feed-in data directly, but 
rather focus on the so-called wind penetration index, which describes the percentage 
of the wind feed-in compared to the total energy production. 
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Table 4 Summary statistics of the wind penetration index from 01.01.2011 to 31.07.2014 


Minimum 1st Quartile Median Mean 3rd Quartile | Maximum 


0.02104 0.08796 0.15480 0.19050 0.26450 0.67540 


In order to compute this index, we follow the approach outlined in [10]: They 
argued that according to recent work by [15], state-of-the-art forecasting models 
lead to load forecasts where the predicted load equals the actual load plus an error 
term, i.e. let AL; denote the actual load on day ¢ and let L; denote the corresponding 
predicted load. Then 


AL; =L;+€, where e  N(0, 07). 


Typically, the standard deviation c is chosen as 2 % of the average load for the period 
considered. 

Following this methodology, we downloaded the actual load data from Datastream 
and the EPEX website and computed the predicted load data by adding Gaussian 
perturbations to the actual load data. Clearly, this is not exactly the same as working 
with the predicted load information from each TSO directly, see [10] for a discussion, 
but the practical impact of this approximation has been found to be marginal. 

We can now define the so-called wind penetration index on day t as 


ve 
WP, = . 
L, 


This is in fact the prediction of the wind penetration on day t, which is available on 
day t — 1 and can hence be considered as forward-looking information. We provide 
the summary information of the wind penetration index for our sample in Table 4. 

Moreover, a time series plot and the corresponding histogram of the wind pene- 
tration index is depicted in Fig. 4. We observe that the time series plot of the wind 
penetration index resembles the one for the original wind data—including a yearly 
seasonal pattern. Also, the wind penetration index is overall rather low, which is 
indicated by the histogram and the quantile information contained in Table 4. This 
is not surprising since the conventional fuels still account for the majority of the 
electricity production in the European energy market. 


2.5 The Relation Between Prices and Wind Data 


Finally, we carry out an exploratory study of the relation between the electricity 
prices and the wind penetration index. In Fig. 5, we plot the electricity prices versus 
the wind penetration index to check whether we can spot any association between the 
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Fig. 4 Time series plot and histogram of the wind penetration index 
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Fig. 5 The electricity prices are plotted versus the wind penetration index. The five horizontal 
lines correspond to the minimum, the 25 %, 50%, 75 % quantiles and the maximum of the wind 


penetration index, respectively 


two variables. We observe that the two lowest electricity prices are associated with 
a rather high wind penetration index. Also, for a very high wind penetration index, 
the prices seem to be below their mean value. This is in line with earlier studies 
which found that a high wind production typically results in lower electricity prices. 
However, we need to keep in mind that by comparing the wind and the electricity 
prices, we can only obtain a partial picture, since clearly other fuels, such as coal, 
gas and nuclear, play a key role in determining the corresponding electricity price 


and are for the purpose of this study excluded from the analysis. 
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Fig. 6 Distribution of the electricity prices for different quartiles of the wind penetration index. 
E.g. the first plot corresponds to a low wind penetration index (the first quartile) and the last one to 
a rather high wind penetration index (the fourth quartile) 


We also compare the distribution of the electricity prices associated with different 
quartiles of the wind penetration index. That is, we have divided our price data into 
four groups corresponding to the 1st, 2nd, 3rd and 4th quartile of the wind penetration 
index. When comparing the corresponding marginal distributions, we observe again 
that smaller price data are associated with a higher wind penetration index, see Fig. 6. 

The finding from this exploratory study motivates the new modelling framework 
which we are going to introduce in the next section. 


3 Model Building 


Recent work by [1] suggests that the class of so-called Lévy semistationary (LSS) 
processes is very suitable for modelling electricity day-ahead prices. In their work, the 
class of LSS processes was used in a truly reduced form modelling set-up, meaning 
that the (deseasonalised) electricity prices were modelled directly by LSS processes 
and no other external variables were included in the analysis. 

Here we will go one step further and explore the possibility of including forward- 
looking information in form of the wind penetration index into a new modelling 
framework which is based on LSS processes. 

In a first step, we are going to review the basic traits of LSS processes and then we 
will discuss how an LSS-based model can be extended to account for forward-looking 
information. 
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An LSS process Y = {Y (t));cg on R without drift is defined as 


t 
Y(t) = g(t — s)o(s—)dL(s), (1) 


—O0o0 


where L denotes a two-sided Lévy process, g : R — R denotes a determinis- 
tic weight function satisfying g(s) = 0 whenever s « 0 and o denotes a càdlàg, 
adapted stochastic volatility process, which is assumed to be independent of L. In 
order to ensure the existence of the stochastic integral, we need suitable integrability 
conditions on the kernel function g, see [1] for details. 

Note that the name Lévy semistationary process indicates that the process Y is 
stationary as soon as the stochastic volatility process c is a stationary process. The 
reason for choosing a stationary process for modelling deseasonalised electricity 
prices is that commodity prices typically exhibit strong mean reversion. A stationary 
process can in fact mimic such a behaviour since it ensures that the process cannot 
move away from its long term mean indefinitely, but will need to return to it since 
otherwise the stationarity would not be preserved. 

Many well-known stochastic processes belong to the LSS class, including volatil- 
ity modulated Ornstein-Uhlenbeck processes, continuous-time autoregressive mov- 
ing average (CARMA) processes and fractionally integrated CARMA processes. 

It is important to note that LSS processes are in general not semimartingales, 
which has been discussed in detail in [1]. However, in this paper, we will in fact be 
staying within the traditional semimartingale framework since we will be working 
with volatility modulated CARMA processes as the main building blocks for our 
new model. To this end, let us briefly recall their definition. 

Suppose we have nonnegative integers p > q and we wish to define a CARMA 
(p, q) process. We introduce the autoregressive (AR) and moving average (MA) 
polynomials: 


PAR) (z) = zP p ayz?7! +--+ + ap, 


PMAD (7) = bo + biz +++ + bp-iz" |, 


where b, = 1 and b; = Oforg < j < p. Moreover, we assume that the polynomials 
have no common roots and then write formally 


PROMY (r) = pM^G (py pr (t), 


where D — r3 We can make sense of the "derivative" of the Lévy process through 
a state space representation, where we write 


Y(t) =b' V(t), for dV(t) = AV(t)dt + edL(t), where 
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o bo 
0 0 1 bi 
A= . e= b= 
"n. 0 
0 0 1 1 by 
—dp —üp-| «++ ++ —d] 


Assuming that all eigenvalues of A have negative real parts, we know that 


t 
V(t) = J eM edL(s) 


is the (strictly) stationary solution of the stochastic differential equation above, see 
[5]. That is, in our LSS specification we can choose g(x) = b'e^*eandc = 1 to 
obtain a CARMA(p, q) process. As soon as a stochastic volatility process is added, 
we would call the corresponding LSS process a volatility modulated CARMA process. 


3.1 Deseasonalising the Data 


We argued before that stationary processes can easily accommodate key stylised 
facts of commodity prices. However, at the same time, we cannot ignore that strong 
seasonal effects are typically present in such markets and need to be accounted for. 
We proceed by introducing an arithmetic model for the electricity day-ahead price, 
denoted by S = (S(t));>0, where 


S(t) = A(t) + Y (t). 


Here, A denotes a deterministic seasonality and trend function and Y denotes a sto- 
chastic process. In the original framework proposed by [1], the process Y was chosen 
to be an LSS process. In the following, however, we will introduce a modification of 
that modelling framework. 

The seasonality and trend function is chosen to be 


+ 2nt + mt 
A(t) = co + cit + c2 cos Cx) + c3 cos (==) 


which accounts for a linear trend and weekly and yearly seasonal cycles. We used a 
robust least squares estimation procedure to estimate the parameters (by truncating 
the spikes in the estimation procedure) and obtained the following estimates and 
standard errors, see Table 5, all of which were highly significant. The estimated 
parameters in the seasonality function confirm the existence of both weekly and 
yearly seasonality as well as the presence of a negative trend. 
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Table 5 Estimated parameters in the seasonality and trend function A 


co al c2 c3 TI T2 
Estimate 53.47 -0.001724 2.377 —6.815 349.25 33.59 
Standard error | 0.4693 0.0006203 | 0.3356 0.3302 50.46 0.3394 


TT | 
MI AY | ii I Ml | y j | 


^C PT I ROPA PONAM d 


Prices 


T T T T T T T 
0 200 400 600 800 1000 1200 


Days 


Fig.7 Electricity day-ahead data and fitted seasonality and trend function 


Figure 7 shows a graph of the original data with the fitted seasonality function 
super-imposed. Also, Fig. 8 depicts the deseasonalised price data and their autocor- 
relation function. 

It should be noted that a variety of alternative procedures could be followed to deal 
with the problem of seasonality in the electricity prices. Here we are dealing with a 
rather simple deterministic parametric function to mimic the trend and the seasonal 
cycles. Using e.g. weekly and yearly dummy variables could refine this approach even 
further, but would result in a less parsimonious model. A more interesting alternative 
to the approach pursued here is to acknowledge the fact that the seasonality cannot 
only be determined by historical data, but also through other market data available to 
market participants. Example, it has been observed that gas and coal prices, given that 
they are important fuels used to produce electricity, play a key role in determining 
trend and also seasonal cycles of electricity prices. This suggests that e.g. forward 
curve data for gas and/or coal could be used to model the trend, see e.g. [8] for 
research along those lines. Also [14] give a detailed account on robust estimation 
procedures of the long-term seasonal component of electricity prices. 
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Fig. 8 Time series and autocorrelation plot of the deseasonalised electricity day-ahead data 


Table 6 Estimated parameters of the associated ARMA(2,1) process 


Estimate 


Standard error 0.0597 0.0447 0.0501 


Here ARI and AR2 corresponds to the first and second autoregressive parameter, respectively, and 
MA corresponds to the moving average parameter 


3.2 Fitting a CARMA Process 


After the seasonality has been removed, we need to find a suitable model for the 
stochastic process Y. Following the success of the CARMA processes in describing 
electricity prices, we choose a Lévy semistationary process where the kernel function 
is given by a kernel associated with a CARMA(p,q) process. More specifically we 
choose p = 2 and q = 1. Note that when choosing the order of the CARMA(p,q) 
process, we need to consider pairs (p,q) such that p > q so that the CARMA 
process is well defined. We choose a CARMA(2,1) process due to reasons of analyt- 
ical tractability and increased flexibility compared to a simple Ornstein-Uhlenbeck 
model. In our goodness-of-fit study, we indeed find that such a model choice is 
suitable here. 

Note that [6] have discussed in detail how a discretely sampled CARMA process 
can be represented as a weak ARMA process. Using this representation we have first 
estimated the corresponding ARMA(2, 1) parameters by a quasi-maximum likelihood 
method. The corresponding parameters estimates and standard errors are provided 
in Table 6. 
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Fig. 9 Empirical (bars) and estimated (solid line) autocorrelation function of the estimated 
CARMA(?2,1) process 


Following the procedure outlined in [6] we can then recover the corresponding 
continuous-time parameters. In our case, we have a, = 1.459, az = 0.162, bo = 
0.383. Note that one can easily verify that the estimates satisfy the condition that 
the eigenvalues of A have negative real parts, which implies a stationary model. We 
compare the empirical and estimated autocorrelation function in Fig.9, where we 
observe a good fit. 

Under the assumption that the CARMA process is driven by a subordinator, [6] 
have shown how the corresponding increments of the driving Lévy process can be 
recovered from a discretely observed CARMA process, see also [7] for the multivari- 
ate case. Here we have implemented their algorithm for the case of a CARMA(2,1) 
process and have recovered the driving process. Note that the original algorithm 
was designed for driving Lévy processes, but can in fact be adapted to the case of 
volatility modulated Lévy processes as well, the case which will be relevant in the 
next section. 

Let us briefly recap our estimation procedure until now: We started off with a spot 
price model S(t) = A(t) + Y(t), where we have estimated the seasonality function 
A and have removed it from the data. In the next step, we have assumed that Y is an 
LSS process of the form 


t 
Y(t) a g(t — s)aM (s), 
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where g corresponds to the kernel function associated with a CARMA (2,1) process 
and initially M was assumed to be a Lévy process, which can be recovered from the 
observations of the CARMA process. We will now leave this traditional framework 
behind and will introduce a new regime-switching model based on LSS processes. 


3.3 The New Model Based on a Regime-Switching LSS 
Process 


The predicted wind penetration index can be viewed as forward-looking information 
since the information is available before the prices for the next day are determined 
in the auction market. Hence it is reasonable to try to incorporate this information in 
the model. 

Previous studies have included such information in discrete-time models such as 
e.g. ARMAX-GARCHX models, see [12], where the wind is treated as an exoge- 
nous variable. However, we are interested in a continuous-time modelling framework. 
E.g. one could consider CARMA-X models or regime-switching models. Here we 
will follow the latter approach which is motivated by the work by [8] who incorpo- 
rated forward-looking capacity constraints into a jump-diffusion model for electricity 
prices. 

We introduce an exogenous regime-switching variable based on the forward- 
looking variable given by the predicted wind penetration index p, where 


l, ifthe predicted wind penetration at time s is "high" 
p(s) = (2) 


0, if the predicted wind penetration at time s is "low". 


The new spot price model is then given by S(t) = A(t) + Y(t), where 


t 
ro- | git —s)dM(s), t>0. 


Here 
dM(s) = p(s)\dM(s) + (1 — pG)d MC s), 
where 
dM (s) =a (s)ds + o9 (s-)aLO (s), 
for independent Lévy processes LU and L®. Also, a denote suitable drift and 
o stochastic volatility processes, for i € (1, 2}. 


The key question which remains to be addressed is how exactly the regime- 
switching variable p should be chosen, given that the expression in (2) appears 
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Fig. 10 Standardised histograms of the increments of M for different levels (associated with the 
four quartiles) of the wind penetration index 


rather casual. In order to answer this question, we study the empirical properties of 
the recovered driving process M. More precisely, we investigate how the marginal 
distribution of the driving process of the CARMA process changes in relation to 
different levels of the wind penetration index. We split the sample of the recovered 
increments of M into four parts corresponding to the four quartiles of the wind 
penetration index, which are given in Table 4. 

Their empirical distributions are described in form of standardised histograms, 
which describe the empirical probability density functions, in Fig. 10. Similarly to 
the finding in our exploratory data analysis, we observe that also the distribution 
of the increments of the driving process M changes quite remarkably for different 
levels of the wind penetration index. In particular, we observe that rather extreme 
negative increments are associated with a relatively high wind penetration index. 

One can imagine a variety of rather sophisticated methods for choosing the cut-off 
point for our regime-switching variable. Here we are interested in a rather simple 
rule, which at the same time allows for a reasonable amount of observations in the 
high regime so that inference is still feasible and does not just rely on a very small 
number of observations. Hence, we choose the cut-off point to be 26.4 % as a hard 
threshold, meaning that all increments of M associated with a wind penetration index 
in the fourth quartile belong to the high regime. 
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3.4 Model for M Based on the Generalised Hyperbolic 
Distribution 


Motivated by the empirical study in [1], we will fitthe class of generalised hyperbolic 
(GH) distributions to the increments of M in the two regimes. 

Our notation for the GH distribution follows the one used in [13]. See also [4] for 
more details on the implementation of the corresponding estimation procedures in R 
available through the ghyp package. 

Let us denote by d, k € N some constants and let X denote a k-dimensional 
random vector. Recall that we say that the law of X is given by the multivariate 
generalised hyperbolic (GH) distribution if 


law 


X 2 p+ Wwy+JWCZ, 


where Z ^ N(0, Ix), C € Rdxk LYE R4. Here W > 0 denotes a one-dimensional 
random variable, independent of Z and with Generalised Inverse Gaussian (GIG) 
distribution, i.e. W ~ GIG(A, x, v). The density of the GIG distribution with 
parameters (A, x, Y) has the following functional form: 


_ (de)? o 1 (x 
Jesu (3) 2KA GA) e«( iG +1). 


where K denotes the modified Bessel function of the third kind, and the parameters 
have to satisfy one of the following three restrictions 


x >0,0>0,A <0, or x»0,5-0,A-0, or x 20,0 >0,A>0. 


The parameter p is called the location parameter, X = CC’ is the dispersion matrix 
and y is the symmetry or skewness parameter. The three (scalar) parameters À, x, Y of 
the GIG distribution determine the shape of the GH distribution. The parametrisation 
described above is referred to as the so-called (A, x, Y, y, X, )-parametrisation of 
the GH distribution. Since this parametrisation causes an identifiability problem when 
one tries to estimate the parameters, we will work with the so-called (A, o, y, E, y)- 
parametrisation in our empirical study. One can show that the (A, x, Y, u, E, y)- 
parametrisation can be obtained from the (A, o, jj, X, y)—parametrisation by setting 


—K)41(@) o? _ Ky@) 
y=a—_., X= oL uS 
KX(a) v Ky (0) 
and A, X, y remain the same, cf. [4]. 
We estimated 11 distributions within the GH class—consisting of the asymmetric 
and symmetric versions of the GH, hyperbolic, Student's t, variance gamma, nor- 
mal inverse Gaussian distribution and the Gaussian distribution and compared them 
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Table 7 Parameter estimates for the Student's t-distribution in the low regime (symmetric case) 
and high regime (asymmetric case) 


Low regime 6.70 2.26 9.77 0 
High regime 4.76 0.57 11.31 -7.68 


Sample Quantiles 
Sample Quantiles 


n 


Theoretical Quantiles Theoretical Quantiles 


Fig.11 Diagnostic quantile-quantile plots: The first picture compares the empirical quantiles of the 
data in the /ow regime with the estimated symmetric Student's t-quantiles, and the second picture 
compares the empirical quantiles of the data in the high regime with the estimated asymmetric 
Student's t-quantiles 


according to the Akaike information criterion. We found that the best model for 
the low regime is given by the symmetric Student's t-distribution and for the high 
regime by the asymmetric Student's t-distribution, see Table 7 for the corresponding 
parameter estimates. 

When comparing the parameter estimates for the low and the high regime provided 
in Table 7, we observe that the skewness, fatness of the tails and the volatility increases 
for the high regime and that the mean parameter decreases compared with the low 
regime. This is in line with previous findings in the literature, which suggest that 
the price level typically decreases with increasing wind energy production and that 
the volatility and the risk for negative spikes (represented through negative skewness 
and fatter tails) typically increases. 

Also, we have provided quantile-quantile plots to assess the goodness-of-fit of the 
Student's t-distribution in Fig. 11, which overall look reasonable. 

The estimation results suggest that a good model for the driving process M in the 
regime-switching LSS specification is given by 
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dM (s) = p(s)d MU (s) + (1 — p(s))adMO (s), 


where 


dM (s) = (uo +y (4^9) ds +a (s)aW (s), 


dMO (s) = pds +o (s)aW (s), 


for independent Brownian motions W‘) and W®). Here the stochastic volatility 
processes o are chosen as Ornstein-Uhlenbeck processes with inverse Gamma 
marginal distribution, since a mean-variance mixture with the inverse Gamma dis- 
tribution results in the Student's t-distribution. 

Note that the reason for choosing volatility modulated Brownian motions rather 
than Lévy processes with Student's t-distribution is that we found a significant short 
term (2 lags) autocorrelation in the increments of the recovered process M suggesting 
that a stochastic volatility model is more suitable than a pure jump model. This finding 
reveals that stochastic volatility is a key feature in energy markets, but it typically 
only exhibits short memory. Stochastic volatility is naturally incorporated into the 
LSS framework making it a convincing modelling tool for energy markets. 


4 Conclusion 


This paper has presented an extension of the modelling framework based on Lévy 
semistationary (LSS) processes introduced by [1]. Since forward-looking informa- 
tion in terms of weather forecasts is available to market participants, the corre- 
sponding predictions for the day-ahead wind production can be derived and used 
when determining day-ahead electricity spot prices. We incorporated this informa- 
tion through the so-called predicted wind penetration index in a regime-switching 
model based on LSS processes. We have observed that the flexibility offered through 
the regime switching component allows to model electricity prices in a more refined 
way than it was possible in the original (reduced form) LSS modelling framework. 
In particular, we have found that a relatively high wind penetration index leads to a 
lower mean level, higher skewness, fatter tails and increased volatility in the distribu- 
tion of the electricity prices. This confirms earlier findings in the literature and for the 
first time links them to a flexible continuous-time stochastic modelling framework. 

Given the increasing importance of renewable sources of energy, it will be inter- 
esting to extend the current investigation to include a wider range of renewables, 
including solar and biogas which up to now do not play as big of a role as wind 
power generation in the European energy market. 

Another area for future research would be to develop a stochastic model for the 
wind penetration index, which could either result in a regime-switching model with 
a stochastic switching parameter or in a joint model for electricity prices and the 
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wind generation index. A preliminary analysis along those lines has revealed that 
a reasonable model for p needs to take both yearly seasonality and clusters into 
account. This could be seen as a first step for constructing models which can be used 
for mid-term forecasts of electricity prices influenced by renewables and would help 
to find modelling and inference tools for reliable risk management in energy markets. 
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Pricing Options on EU ETS Certificates 
with a Time-Varying Market Price of Risk 
Model 


Ya Wen and Rüdiger Kiesel 


Abstract To price options on emission certificates reduced-form models have 
proved to be useful. We empirically analyse the performance of the model proposed 
in Carmona and Hinz [2] and Hinz [8]. As we find evidence for a time-varying mar- 
ket price of risk, we extend the Carmona-Hinz framework by introducing a bivariate 
pricing model. We show that the extended model is able to extract information on 
the market price of risk and evaluate its impact on the EUA options. 


Keywords Carbon market - EUA futures - Risk-neutral valuation * Market price of 
risk + Option pricing 


1 Introduction 


The European Union Emissions Trading Scheme (EU ETS) was launched in 2005 
and constitutes still the world's largest carbon market. The EU ETS was set up 
as a cap-and-trade scheme and split up into three phases, namely Phase I (2005- 
07) without the possibility to bank unused permits; Phase II (2008-12) in which 
banking was allowed; and the current Phase III (2013-20) which, compared to the 
two previous periods, introduced significant changes, such as the abolishment of 
national allocation plans and the auctioning of permits. 

Besides permits, futures and options on permits are being traded. Various authors 
have discussed the design of the market and the pricing of the permits and the 
derivatives traded. The fundamental concepts for emission trading and the market 
mechanism have been reviewed in the paper of Taschini [15], which also provides 
a literature overview. Equilibrium models for allowance permit markets have been 
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widely used to capture the theoretical properties of emission trading schemes. Exam- 
ples are the dynamic but deterministic model proposed by Rubin [13] and stochastic 
equilibrium approaches such as Seifert et al. [14], Wagner [16] and Carmona et al. 
[3]. These models use optimal stochastic control to investigate the dynamic emission 
trading in the risk-neutral framework. Carmona et al. [4] derive the permit price for- 
mula which can be described as the discounted penalty multiplied by the probability 
of the excess demand event. Its historical model fit has been evaluated by Grüll and 
Taschini [6]. Grüll and Kiesel [5] used the formula to analyse the emission permit 
price drop during the first compliance period. Carmona and Hinz [2] and Hinz [8] 
propose a reduced-form model which is particular feasible for the calibration of EUA 
futures and options as it directly models the underlying price process. Both Paolella 
and Taschini [12] and Benz and Trück [1] provide an econometric analysis for the 
short-term spot price behavior and the heteroscedastic dynamics of the price returns. 
For the option pricing, Carmona and Hinz [2] derive a option pricing formula from 
their reduced-form model for a single trading period. They also discuss the extension 
ofthe formula to two trading periods. Hitzemann and Uhrig-Homburg [9, 10] develop 
an option pricing model for multi-compliance periods by considering a remaining 
value component in the pricing formula capturing the expected value after a finite 
number of trading periods. 

As emission certificates are traded assets their price paths carry information on 
the market participant expectations on the development of the fundamental price 
drivers of the certificates including the regulatory framework. In particular, prices 
of futures and options of certificates carry forward-looking information which can 
be extracted by using appropriate valuation models. In this paper we derive such 
a model by extending the reduced-form pricing model of Carmona and Hinz [2]. 
Using an extensive data set we extract a time series for the implied market price of 
risk, which relates to the risk premium the investors attach to the certificates. This 
requires a calibration of the model to historical price data during varying time periods 
and with different maturities of futures and options. A crucial step in the calibration 
procedure is a price transformation of normalized futures prices of permits from a 
pricing measure to the historical measure. We find that the implied market price of risk 
possesses stochastic characteristics. Therefore, we extend the existing reduced-form 
model by modeling the dynamics of the market price of risk as an Ornstein-Uhlenbeck 
process and show that the extended model captures the appropriate properties of the 
market. The market price of risk is an implied value related to the permit prices, this 
requires an extension of the univariate permit pricing model to a bivariate one. In 
this context, the standard Kalman filter algorithm is considered to be an effective 
way to calibrate to the historical prices. We apply this methodology and estimate the 
implied risk premia. Once the risk premia have been determined, EUA option prices 
can be derived to fit the bivariate model setting, which helps to improve the accuracy 
of the pricing. 

This paper is organized as follows. In Sect. 2 we introduce the basic reduced-form 
model based on a risk-neutral framework. We calibrate the model with an extended 
data series and compare the calibration results. In Sect. 3 an extended bivariate pricing 
model will be introduced in order to capture the market information of the risk premia. 
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We demonstrate how to calibrate the extended model by applying the standard 
Kalman filter algorithm, we present the estimation results of this procedure and 
discuss the model fit. In Sect. 4 we evaluate the option pricing performance by taking 
into account the calibration results of the bivariate model. Section 5 concludes. 


2 Univariate EUA Pricing Model and Parameter Estimation 


The basic reduced-form model based on a risk-neutral framework was introduced 
by Carmona and Hinz [2]. Here, the aggregated normalised emissions are modelled 
directly and it can be shown that the emission certificate futures process solves a 
stochastic differential equation. In this section we give a brief introduction to the 
model and discuss the quality of the model calibration. 


2.1 Univariate Model 


We consider an emission trading scheme with a single trading phase with horizon 
[0, T]. The price evolution of emission permits is assumed to be adapted stochastic 
processes on a filtered probability space (2, F, (.7;);epo,rj, P) with an equivalent 
risk-neutral measure Q ~ P. Based on the assumption of a market compliance 
at time T the price has only two possible outcomes, namely zero or the penalty 
level. The argument is as follows. If there are sufficient emission allowances in 
the market to cover the total emissions at compliance time, surplus allowances will 
become worthless. Otherwise, for undersupplied permits the price will increase to 
the penalty level. 

We introduce the reduced-form model of Carmona and Hinz [2]. Here the nor- 
malized futures price process is a martingale under Q given by 


ar = Er yl|Fil, t € (0, TI. (1) 


The process (/7):e[o,r] denotes the aggregated normalized emission, and is assumed 
to follow a lognormal process given by 


t m 1 rt _2 
T; = Melo %4Ws-2 foods T) e (0, 00), 


where o; stands for the volatility of the emission pollution rate. t —— o; is a determi- 
nistic function which is continuous and square-integrable. (Wo te[0,7] is a Brownian 
motion with respect to Q. Carmona and Hinz [2] prove that a; solves the stochastic 
differential equation 2 

da, = (6 (a) /zdW,, Q) 


344 Y. Wen and R. Kiesel 
with the function t — z;, t € (0, T), given by 


2 
Or 


ra o2du 


(3) 


Zt = 


In order to calibrate the model, Carmona and Hinz [2] suggest to use the function 
z-—pT-1^, 
with a € R and £ € (0, oo), so one has 


da, = 9'($ (aj) PT — t) * AW,. (4) 


To estimate the parameters one has to determine the distribution of the price variable. 
For this purpose one considers the price transformation process 4; defined by a; = 
$4 (£j), where ® denotes the cumulative distribution function of the standard normal 
distribution. By applying Itó's formula one has 


1 
dë, = (So + van) dt + /zdW,, (5) 


where W, denotes the Brownian motion under the objective measure P and h is the 
market price of risk which is assumed to be constant. Moreover, it can be shown that 
&; is conditional Gaussian with mean jz; and variance o? so that the log-likelihood 
can be calculated and Maximum-likelihood estimation can be applied to find the 
model parameters. 


2.2 Estimation 


We calibrate the model to different emission trading periods during the first and 
second EU ETS trading phases. We consider the daily prices of the EUA futures 
with maturities in December from 2005 to 2012. Their historical price series are 
shown in Fig. 1. 

The price transformation é; is conditional Gaussian with its mean us and vari- 
ance gf. We consider the daily historical observations of the EUA futures at time 
t1, t2,..., tn. Their corresponding price transformations can be determined using the 
definition a; = ®(&,). Thus the parameters o, 6, h can be estimated by maximizing 
the log-likelihood function given by 


n = ph, at, 2 
T T»! Gr I “em Zo (6) 


i-l 
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Fig. 1 Historical prices of the EUA futures with maturities in December from 2005 to 2012 


Under the model assumptions the residuals 


£j m Lith, a, B) 
Vo; (a. B) 


must be a series of independent standard normal random variables. So standard 
statistical analysis can be applied to test the quality of the model fit. We show our 
estimation results in Table 1. The horizons of the price data are two years, starting 
from the first trading day in January of the previous trading year to the last trading 
day in December of the next year. 

Comparing the estimation values in Table 1, the instability of the parameter va- 
lues in each cell for different time periods can be observed. Note the value for the 
market price of risk changes its sign during the first and second trading phase. This 
implies the inappropriateness of the assumption for a constant market price of risk. 
The fourth value in each cell is the negative of LLF. Note the -LLF are much lower 
after the first trading phase because the price collapse during 2006 to 2007 affects 
the data partially. 

From Figs. 2, 3, 4, 5 and 6 we display the time series of the residuals w;, their empir- 
ical auto-correlations, empirical partial auto-correlations and quantile-quantile-plots. 
We choose EUA futures with maturity in December 2007 (EUA 07) and EUA futures 
with maturity in December 2012 (EUA 12) as examples. 

The time series w; show an effect of volatility clustering. This is confirmed by 
significant values to high lags in the sample autocorrelation and sample partial auto- 
correlation. Also the Q-Q plots, especially for the first trading phase, indicate heavy 
tails and a non-Gaussian behavior. A formal analysis with an application of Jarque- 
Bera test rejects the hypothesis that the data set is generated from normally distributed 
random variables. In order to improve the model fit we extend the model by intro- 
duction of a dynamic market price of risk in Sect. 3. 


wi = 


i=l,...,n, (7) 
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Sample Autocorrelation Function 


Fig. 2 Statistical analysis of EUA 07, time period 05—06 and 06-07 
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Fig.3 Statistical analysis of EUA 12, time period 05-06 and 06-07 


residuals 


Sample Partial Autocorrelations 


Sample Autocorrelation Function 


5 r1 1 
5 
5 
E 05 
8 
D E 
Ha III 
è 
H 
-5 9 95 
0 200 400 600 0 5 10 15 20 
time in days. Lag 
Sample Partial Autocorrelation Function QQ Plot of Sample Data versus Standard Normal 
d$ ———————————————À4 24 
è + 
Bo 3 
05 3 
=o 
* 
opnEREÉ c 2 
-05 g aL st 
0 5 10 15 20 -4 -2 0 2 4 
Lag Standard Normal Quantiles 


Fig. 4 Statistical analysis of EUA 12, time period 07—08 and 08-09 
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Fig. 5 Statistical analysis of EUA 12, time period 09-10 and 10-11 
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Fig.6 Statistical analysis of s 
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3 Bivariate Pricing Model for EUA 


The evidence in the previous section shows that the market price of risk is actually 
time varying rather than constant. In order to illustrate the dynamic property of the 
market price of risk we consider a bivariate permit pricing model in this section. 


3.1 Model Description 


We model the market price of risk as an Ornstein-Uhlenbeck process as its value 
can be either positive or negative and denote it by A;. Recall the equation for the 
normalized price process under the risk-neutral measure ( given by (2). According 
to Girsanov’s theorem, the bivariate pricing model under the objective measure P is 
given by 


da, = $'(5^ (a) /z adt + dW), 
dÀ, = OÀ — Aydt + od W?, 
dW} dW? = pdt, 
where W} and W? are two one-dimensional Brownian motions with correlation coef- 
ficient p. Note that under the model assumptions, the filtration (.Z;) in the probability 
space must be assumed to be generated by the bivariate Brownian motion. 


The use of Girsanov’s theorem in the bivariate model requires the condition that 
the process Z; given by 


t 1 t 2 
Z =exp( T AdW,— 5 [i ids) (8) 


is a martingale. A sufficient condition for (8) is Novikov's condition: 


j| exp G [ xia; | « oo. (9) 
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Under our model assumptions, this condition is always satisfied.! 

To calibrate the model we use the transformed price process to avoid complex 
numerical calculations in the calibration procedure. The bivariate model can be refor- 
mulated as 


1 
dé, = (Sa + Van) dt + /zdW; , (10) 
dÀ, = 6( — Aydt + o,dW?, (11) 
dW} dW? = pdt. (12) 


In (11), A represents the long-term mean value. 0 denotes the rate with which the 
shocks dissipate and the variable reverts towards the mean. o; is the volatility of the 
market price of risk. According to Carmona and Hinz [2], the price transformation 
is conditional Gaussian and its SDE can be solved explicitly. 


3.2 Calibration to Historical Data 


We consider the discretization of the model (10)-(12). By assuming the constant 
volatility terms in the time interval [t;_1, tg], the model equations can be discretized 
under Euler's scheme given by 


1 
£n ES AZ Athy; + (1 + - En + af tte MEQ (13) 
Ay = (1— OAD Ay_, + OAL + oV AE, (14) 


Cov(&], &) = p, (15) 
where At = ty — (ty 1), namely the time interval, and Eqs E ~ MN (0, 1). z can 
be modeled by using the function £ (T — t;) ^. The model parameter-set is therefore 
y = [0, à, 05, p, a, B]. 

AS A4, is a hidden state variable related to the price transformation, and only values 
of £y at time points t1, t2, ... , t, can be determined from the market observations, the 
market price of risk series can be estimated by applying the Kalman filter algorithm. 
We have chosen to use the transformation process instead of the normalized price dy, . 
Because of the linear form of Eqs. (13) and (14) the standard Kalman filter algorithm 
is considered to be an efficient method for the model calibration. A detailed procedure 
to apply the standard Kalman filter can be found in [7]. To apply the Kalman filter 
model Eqs. (13)-(15) must be put into the state space representation to fit the model 
framework. The measurement equation links the unobservable state to observations. 
It can be derived from (13) and (14). After some manipulations, the equations of the 


! A proof can be found in Appendix 1. 
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state space form for the model can be rewritten? as 


1 - 
Sy = V B(T — t) * Atay, + ( uii wW) En + V B(T — t) Até, 


(16) 

and 

= op 1 = 
Ay = | 0AAt-— — — — (( + SBT — t) )) Eni T &) 
VET — 1) 2 

+A —6At — op AD), Hov (1 — 92) At, (17) 
where él and eL are independent, standard normally distributed random variables. 
For the estimation of the parameter vector y = [0, A, 05, p, a, D] consider 
the variable &;,. In each iteration of the filtering procedure, the conditional mean 
SEn lEn ..., En] and the conditional variance Var (En |&,,..., &j ,) can be cal- 


culated. We denote mean and variance by uy (Y) and X, (V), respectively. The joint 
probability density function of the observations is denoted by f (5; | V) and is given 
by 


: n 1 (En, — Ht wr) 
Peni Homo) 


where &;,,, summarize the observations from &, to &,,. Its corresponding log- 
likelihood function is given by 


on _1x 1 s^ Ey = a OY 
obs (Y lEn) = 2 log 2x 2 2. log Xn (V) 2 2. X (yr) i 


(18) 


The estimation results, their standard errors, t-tests and p-values can be found in 
Table 2. Figure 7 shows the estimation results of the market price of risk, compared 
with the price transformation and the historical futures price. In Fig.8, a negative 
correlation between the price transformation and the market price of risk can be seen. 
The market price of risk is the return in excess of the risk-free rate that the market 
wants as compensation for taking the risk.? It is a measure of the extra required rate 
of return, or say, a risk premium, that investors need for taking the risk. The more 
risky an investment is, the higher the additional expected rate of return should be. 
So in order to achieve a higher required rate of return, the asset must be discounted 
and thus will be sold at a lower price. Figure 8 reveals this inverse relationship. 

Moreover, we use the mean pricing errors (MPE) and the root mean squared errors 
(RMSE) given by 


>For a derivation of the state equation see Appendix 2. 
For an economical explanation see [11], Chap. 27. or [17], Chap. 30. 
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Table 2 Test of model parameters at significance level 5 %, sample size 1536 


Parameter Coeff Std Err t-test p-value 
0 5.7601 0.0000 
n 4.0641 0.0001 
O 17.6365 0.0000 
p 9.0910 0.0000 
a 61.5603 0.0000 
B 0.0172 0.0005 35.6312 0.0000 
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Fig.7 MPR, futures price and price transformation from Jan. 2007 to Dec. 2012 
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Table3 Performance of MPE and RMSE with 2000 observations 


353 


Maturity MPE RMSE 
1 month —0.0153 0.0182 
3 months —0.0208 0.0234 
6 months —0.0366 0.0397 
9 months —0.1273 0.1302 
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Fig.9 Statistical tests for the residuals in trading phase 2 


respectively, to assess the quality of model fit. Here N denotes the number of obser- 
vations, Esc is the estimated price to maturity T, and &;, ; is the observed price. Their 
values can be seen in Table3. The absolute values of MPE and RMSE increase with 
time but still remain very low even 9 months before the maturity. Therefore, the 
conclusion is that the model is able to reproduce the price dynamics. 

In Fig.9 we show the standard statistical test results of the residuals by taking into 
account the dynamic market price of risk. Comparing with the results from Figs. 2, 3, 
4, 5 and 6, the time series of the residuals is relative stable with smaller variance. The 
sample auto-correlations and sample partial auto-correlations reveal very weak linear 
dependence of the variables at different time points. Also, the Q-Q plot indicates a 
better fit of a Gaussian distribution. 


4 Option Pricing and Market Forward Looking Information 


A general pricing formula of a European call is given by 


Ci=e L rsds 


EQ t(A, — K)*|.Z;], 
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where {rs}se[o,7] stands for a deterministic rate, A; denotes the futures price, K > 0 
is the strike price, and t € [0, T] is the maturity. The normalized price process a; is 
given by a; — A;/P, where P denotes the penalty for each ton of exceeding emissions, 
and therefore we have A; = PØ (£j). A call option price formula written on EUA has 
been derived by Carmona and Hinz [2] under the assumption of a constant market 
price of risk. Under the assumption of a dynamic market price of risk, the option 
price formula is coherent with the formula in [2] given by 


C, — e f ns ii (P®(x) — Kj pino ds, 


where ¢ stands for the density function of a standard normal distribution. Here r,r 
and BE are the parameters of the distribution of £j, which is conditional Gaussian. 
Under the risk neutral measure Q, us: and 02. are given by 


T 
1 d Z id Z 
Mtr = ei^ zisg, of = J zels zudi qs. 
t 


In the following example, the penalty level is P = 100, the initial time t = 0 
starts in April 2005. EUA futures has maturity T on the last trading day in 2012. The 
European calls written on EUA futures with a strike at K = 15 and maturity T will 
be considered under a constant interest rate at r = 0.05. Figure 10 shows the call 
option prices and the futures prices. The red curve stands for the option prices under 
dynamic MPR while the green curve stands for the option prices under constant 
MPR. 

To measure the impact of the dynamic market price of risk on the EUA option for 
different strikes we calculate the option price in the univariate and bivariate model 
setting respectively. Durations from 1, 3, 6 and 12 months to maturity are chosen for 
calls written on EUA 2012. The results are plotted in Fig. 11. The red curve stands for 
the option prices evaluated by the bivariate model and the blue curve by the univariate 
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Fig. 10 Futures price and call option prices with K = 15 from 2005 to 2012 
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Fig.11 Calloption prices comparison for durations of 1,3, 6, 12 months on EUA 2012 for different 
strikes 


model. The green line is the corresponding futures price at the given time. In most 
cases, one is interested in the option prices near the underlying price. According to 
the figure, the option prices in different model settings coincide except for a interval 
around the corresponding futures. In a short time before the maturity of EUA 2012, 
Fig. 11 shows a price overestimation by the constant MPR. This result is consistent 
with the result shown in Fig. 10, where we take K = 15 as a sample path. 
Moreover, one notes that the call price process with constant MPR develops below 
the call process with dynamic MPR in the first trading phase before 2008 and then 
increases slowly and moves to the upside of the call process with dynamic MPR dur- 
ing the second trading phase, before both processes vanish to the maturity because of 
lower underlying prices. The reason for the price underestimation before 2008 and 
overestimation thereafter can be explained as the assumption of a constant MPR in 
the whole trading periods and thus causes a neglect on the information of the market 
participants. Due to the regulatory framework of the carbon market, certificates carry 
information on the market participant expectations on the development of the fun- 
damental price drivers. Since the implied risk premia increase with time and exceed 
their 'average' level in 2008, asset price must be discounted to compensate the higher 
risk. By using appropriate valuation models, this risk premia and the forward-looking 
information carried by prices of futures and options of certificates can be extracted. 


5 Conclusion 


We extract forward-looking information in the EU ETS by applying an extended 
pricing model of EUA futures and analyzing its impact on option prices. We find 
that the implied risk premium is time-varying and has to be modeled by a stochastic 
process. Using the information given by the risk premium we show that the option 
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prices during the first and second trading phases are underestimated and overesti- 
mated, respectively. The reason for the pricing deviation is caused by the assumption 
of a constant market price of risk which rigidifies the market participant expectations 
on the development of price drivers. The over- and underestimated prices are mostly 
concentrated in the interval including the corresponding futures, which is the area 
where the price most likely will evolve in the future. Although there is not a closed 
form for the option pricing formula, a simple numerical approach can be used to 
determine the price. 


Open Access This chapter is distributed under the terms of the Creative Commons Attribution 
Noncommercial License, which permits any noncommercial use, distribution, and reproduction in 
any medium, provided the original author(s) and source are credited. 


Appendix 1 
In order to show the condition in (8), it is sufficient to prove the Novikov's condition 


given by 
1 T 
z| exp (5 / xias) | « oo. 
2 Jo 


In the bivariate EUA pricing model, where À+ follows a Ornstein-Uhlenbeck-Process 
given by 


hy = O(A — Apdt + o3dWt, 


this condition is always satisfied. 


Proof We first show that there exists a constant € > 0 such that for any S c [0, T], 


we have 
1 
[ exp (5 f 


To show (19) we consider the term in the expectation notation. By applying Jensen’s 
inequality we have 


] ste S+E ] ẹ 
exp Gf Xd) — exp (f - Adi) 
2 S S £2 
] ste ] ste 
= exp HI -Ajdo) < 3 exp (2z2)a. 
€ Js 2 € Js 2 


By applying Fubini's theorem (19) becomes 


- [ m | exp (597) Jar. (20) 


S+ e 


pdt) | < oo. (19) 
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The process A; is a Gaussian process with mean and variance given by 


BA] = ui = Ape E A(1 — e 95, 


c2 
Var(A;) = 02 = 2 (1 — e”). 


We have Ay ^ N (lr, d Now let Z be a standard normal-distributed random 
variable Z ~ M (0, 1). So in (20) we have 


[eo C] nee Gen) 


g en? £022? 
= E| exp (t+ emoz + 77—)] 


en? eo?x? N 1 X 
= B (SE + emis + 2 aor ( ;)e 
2 2 
— Ely ) i. 1 ( l= eof 4 Ja 
exp ( 2 a Vin? 2 x^ + EUtOtxX Jax. 


To calculate the integration term above, let a; — 1 — eo? and b; = £ u01, and make 
the integral-substitution. Then we have 


1 1 — £02 
f exp ( Pol xi £1101) dx 
R 4/27 2 


= [a ni 2h) = 


= Fah rao Ca C Ge 


2 (y - ty? 
= = exp ($4) f, I a Jay 


1 b? 
= —— exp (5). 
A/ at 2a; 


According to the assumptions a; = 1 — eo? is positive and the expectation is con- 
vergent for a small & and its value is 


2 2 
[exp (533 SE exp ( 2 exp a 


1 2 2,242 


€, E [Aot 
= exp ( 2 exp (7-5 š 
m t 
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Thus the integral in (20) is finite and the exponential term in (19) is integrable. 

In order to show Z; is a martingale we first consider that Z; is a local martingale, 
hence it is a supermartingale. Therefore, Z, is a martingale if and only if the condition 
“[Z;] = 1, Vt € [0, T], is satisfied. This martingale property can be shown by 
induction. Suppose E[Zo] = 1 which is trivial and E[Z,] = 1 for t € [0, S] for 
S < T. Let now t € [S, S + £] and set 


t 1 t 
Zi = exp ( f AdW, — = f itd). 
j 2 Js 


According to Novikov condition and (19), Zi is a martingale. Then we have 


AZA] = E[ZsZ$] = E[E[ZsZ§]|_¥s] = E[ZsE[Z5|.¥s]] = EIZsZ$] = EIZs], 


since 


S 1 S 
E. = exp (f AsdW, — J Ads) = exp(0) = 1. 
S S 


It follows 


ZJ = E[Zs] — 1 fort e [$, S+ e]. 


Then we have IE[Z,] = 1 for t € [0, S + e]. Repeat this induction for rs times 
we have E[Z,] = 1 for f € [0,7], which implies Z; defined in (8) is a 
martingale. 


Appendix 2 


The bivariate EUA pricing model can be described as follows: 


1 
En = zy Atn + (1 + stu) Bn y Zy AAT 


An = (1 — 0At) + ÓXAL ory AL, 
1 @2 
Cov(£l, 62) = p, 
where El and eL are both random variables of the standard normal distribution. We 
want to put the model into the state space form. Price transformation depends on 


the current level of the market price of risk, which is an unobservable variable and 
therefore must be modeled in the equation of A;,. We first let 


ép me Ge — Vl - pee + Ey, 
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where él and e are both random variables of the standard normal distribution as 
well. This fact can be easily seen since we have 


7 7 Ez _ pE e " p&} 
Cov(&l ; E2) = cm (sh. —*___k |= cov &, tt) + Cov gl pe k =0. 
k k k /1 — p2 k /1 — p2 k /|1 — p2 


Note that 


1 
V a Atay + (1 + zik- j E, ty Zn Até] — Er = 0. 


Multiplying —o5 P (Z) ? at the both sides of the equation and sum itto the equation 
of Àp, it follows that 


Ay = (0 —06A0X4 , — 040 AUa, , ÓXAL, 


op 1 2 1 
TUR (( + z2- ) En) -&) t oy Atép — OV Atpó, 


= op 1 
= (1 — 0A^t — 040 A^t)Àg , + OXAt — 7 1+ 77k- 1 En 1 — En 
tk—1 
tou AtV1-— pP Ep. 


This is the transition equation in the state space form, and the measurement equation 
would be 


1 
Sy = Sy = Sep Atay, + ( + ju) En + 2 AtE 
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